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This non-provisional application takes priority from U.S. Provisional Application 
Number 60/234,015 filed on September 20 th , 2000. 

FIELD OF THE INVENTION 

This invention relates to the field of computer software. More specifically, the 
invention relates to a method and apparatus for dynamically formatting and displaying 
tabular data in real time. 

Portions of the disclosure of this patent document contain material that is subject 
to copyright protection. The copyright owner has no objection to the facsimile 
reproduction by anyone of the patent document or the patent disclosure as it appears in 
the Patent and Trademark Office file or records, but otherwise reserves all copyrights 
whatsoever. 

BACKGROUND 

Presenting data records in tabular form (e.g., as a set of rows each with the same 
number and types of column) is a well-known way to compactly represent large 
quantities of information. As a result, people frequently present data records (e.g., 
printed or displayed) using tables to convey different kinds of information. Product 
catalogs, for example, typically contain a large number of tables representative of the 
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various product alternatives available in the catalog. The following example illustrates 
the effectiveness of using a table to describe multiple aspects of a product line. 
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Price 1 
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F-35-100-12 
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F-35-200-12 
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24 
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$3.75 


F-35-400-24 
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36 


F-35-1 00-36 


$4.25 


F-35-200-36 


$4.25 


F-35-400-36 


$4.25 



Computer software programs such as Quark Express™, PageMaker™, Microsoft 
Word™, Microsoft Excel™, and many others provide mechanisms for generating such 
tables. However, the approach these program use to manipulate table data is 
cumbersome and lacks flexibility. Although it is possible to generate and format tables 
using these programs, the process for doing so is unnecessarily laborious. For example, 
in most case users are required to manually provide numerous commands relating to 
each cell, row, or column of information in the table. Any page layout having a table 
must be meticulously laid out with existing page layout programs a page at a time and 
formatted a table at a time by manually populating page layouts with product data, a . 
process that is time-consuming, tedious and very, very expensive. There is also no easy 
way to experiment with different tabular layout formats and views of the data, and 
once a page has been laid out, it is difficult to add or remove records from the tables 
without destroying the structure of the page and requiring that it be laid out again 
(sometimes from scratch) which discourages updates and means that catalog pages tend 
to quickly become out-of-date. 
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The upside of this complex process, however, is that manual page layout usually 
results in high page density, flexible and well-structured tabular layout formats using 
pivots to eliminate redundant information, and a very high overall standard of quality. 
Notwithstanding the high level of quality, however, it remains difficult to enforce a 
uniform look throughout a publication because more than one person is usually 
involved in the page layout process, and each lays out pages somewhat differently. If 
formatting changes are made (e.g., a row or column is moved, filtered, or sorted), the 
changes become permanent once the file is saved into persistent memory. Some 
programs have an undo command that enables users to undo formatting commands in 
sequence while the document is still in transient memory (e.g., RAM), but the undo 
command cannot remove commands out of sequence nor can it operate once the file is 
saved into persistent memory. 

By contrast, electronic catalog pages are typically database-driven and generated 
programmatically in real-time. Since page layouts do not actually exist until the 
electronic catalog page is displayed, new products can be added and old products 
removed without disturbing the system or the published output. Unfortunately, the 
downside of this flexibility is that automatically generated electronic catalog pages are 
usually no more than wide, ugly, "spreadsheet-style" tables of data with redundant 
information, very little structure, and none of the sophisticated tabular layout formats 
that are standard for paper pages. With category-specific attributes and a large number 
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of categories, it is even more impractical to have a customized hand-coded display for 
each family, so generic unstructured presentations are even more the norm. 

Moreover, when publishing to multiple media, none of the effort invested in 
meticulously laying out paper pages can be leveraged for the electronic catalog, since 
both the structure of the tabular layout formats as well as the product data are typically 
trapped within the page layout itself, while the electronic catalog requires that the data 
be stored and managed in a database to be searchable and generated in real-time. Thus 
the worlds of the two media are completely distinct and non-overlapping, very difficult 
to integrate, and require two distinct publishing efforts. 

Another problem existing programs for manipulating table data have is that 
these programs lack a mechanism for dynamically formatting table data in real time. 
For example, existing programs do not automatically modify the layout of a table in real 
time upon receipt of formatting commands from the user. Thus, users cannot 
instantaneously view changes made to the table at the time such changes are made. 
This limits the users ability to incrementally change the various formatting options until 
a satisfactory visual appearance is achieved. 

Many database programs have the ability to present different views of data. For 
instance, most Relational Database Management Systems (RDBMS) include report 
writers that provide a platform for publishing information stored in the database as 
formatted presentations with simple tabular layout formats. Those of ordinary skill in 
the art are familiar with techniques for instructing such report writers to combine 
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information from records in multiple tables, format the common information associated 
with each family in a structured way, and finally sort the records of tabular information. 
This approach works well with a relational database in which the field structure and the 
set of fields is consistent across the entire set of records, the field definitions are 
relatively static, and the number of fields is limited; because each field applies across 
the entire database, special handling and formatting for a particular field or fields is 
coded only once rather than multiple times. 

By contrast, in a database with category-specific attributes, the field structure 
and set of fields differs for the records of each category. Thus, the report writer needs to 
be coded with specific intelligence about how to handle attributes on a category-by- 
category basis. If there are a large number of categories and attributes, this can be 
extremely tedious, time-consuming, and error prone to implement. The report writer 
must then be recoded each time changes are made to the taxonomy structure (e.g., 
organizational structure) and/or the set of attributes associated with each category, 
which makes the report writer approach difficult to maintain as the taxonomy changes 
over time. 

Moreover, using existing report writers (or HTML) to present and structure 
tabular information more efficiently using pivot columns is a manual process that 
requires programming expertise that is beyond the capability of most average users. It 
is also data-dependent, so that even if the report writer can be used to create the pivot 
tables, the code then needs to be rewritten to reflect the data each time changes are 
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made to the underlying records. Moreover, if each family requires a different tabular 
layout format because of category-specific attributes, then the particular tabular layout 
format for each family must be individually coded in the report writer, substantially 
increasing the coding complexity. 

The issues identified above become particularly problematic when publishing 
catalogs that contain tables of product information. The manner in which catalogs are 
typically published is a laborious process that involves the manual entry of data and 
layout of the catalog. Such processes keep the cost of producing catalog or any other 
publication unnecessarily high. Thus there is a need for a system that dramatically 
reduces the cost of laying out tabular data by flexibly, programmatically, and 
automatically generating page layouts in real time. 



6 



WO 02/25500 



PCT/US01/29486 



SUMMARY OF THE INVENTION 



Embodiments of the invention improve upon current systems by allowing users 
to dynamically generate and repeatedly modify the appearance of any set of tabular 
data. When the system obtains input relating to formatting the table, the appearance of 
the table is dynamically modified so the users can instantaneously view any changes to 
the table caused by the input (e.g., WYSIWYG). The system accepts various types of 
input and upon receipt of that input the system changes the appearance of the table in 
accordance with the input provided. Thus, the user may repeatedly modify the table by 
providing different or additional input and viewing the results of the input. This 
enables users to fluidly add, remove, and/or otherwise manipulate layout values 
associated with the table. 

In one embodiment of the invention, the user input (e.g., layout information) 
relates to various types of pivot operations, sorting operation, and/or merging 
operations performed on the table. The user may, for example, select a certain field and 
then initiate a pivot operation using the selected field. The system is configured in 
accordance with one embodiment of the invention so that the layout information is 
stored independent of the table data. This arrangement enables the system to apply the 
layout information to any set of tabular data, even if that data was not used to 
determine the layout. Additionally, the system can manipulate the tabular data set 
without changing the underlying structure of the data. The layout information can also 
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be associated with or dependent on a particular set of tabular data and stored along 
with that data. In this instance, the layout information is part of a file or set of files 
related to the tabular data. 
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DESCRIPTION OF THE DRAWINGS 

Figure 1 is a flow chart that illustrates the process for enabling systems to 
implement one or more embodiments of the invention. 

Figure 2 illustrates specific types of layout information in accordance with one or 
more embodiments of the invention. 

Figure 3 illustrates the components of a graphical user interface configured in 
accordance with an embodiment of the invention. 

Figure 4 illustrates a tabular data set before the execution of any pivot operations 
in accordance with an embodiment of the invention. 

Figure 5 is a flow charts that illustrates the functions executed when a stack 
pivot, horizontal pivot, or vertical pivot is requested in accordance with an embodiment 
of the invention. 

Figure 6 illustrates a tabular data set in accordance with an embodiment of the 
invention after a stack pivot is performed in accordance with an embodiment of the 
invention. 

Figure 7 illustrates a tabular, data set after a stack pivot and horizontal pivot is 
performed in accordance with an embodiment of the invention. 
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Figure 8 illustrates a tabular data set after a stack pivot, horizontal pivot, and 
vertical pivot is performed in accordance with an embodiment of the invention. 



Figure 9 is a generalization of several pivot operations in accordance with an 

embodiment of the invention. 
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DETAILED DESCRIPTION 

An embodiment of the invention comprises a method and apparatus for 
dynamically formatting any type of table data that further extends upon current 
systems by providing users with a flexible interface for manipulating the table data in 
real time. In the following description numerous specific details are set forth in order to 
provide a more thorough understanding of the present invention. It will be apparent, 
however, to one skilled in the art, that the present invention may be practiced without 
these specific details. In other instances, well-known features have not been described 
in detail so as not to obscure the invention. The reader should note that although 
certain details are set for the herein, the claims and the full scope of any equivalents is 
what defines the invention. 

System Overview: 

Embodiments of the invention improve upon current systems by allowing users 
to dynamically generate and repeatedly modify the appearance of any set of tabular 
data. If the tabular data comprises records having a defined association, the appearance 
of the table can be modified while still maintaining the integrity of the relationships 
between fields and/or attributes in the table. For instance, if the data contained in a 
table represents a set of records sharing a common value (e.g., a field and/or attribute), 
the relationship to the common value can be maintained while still allowing users to 
freely modify the appearance of the table. However, the invention is not limited to 
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manipulating records sharing a common value and systems embodying the invention 
can also manipulate table data that does not have any defined relationships. The 
invention therefore has applicability across multiple types of computer programs (e.g., 
word processing, spreadsheets, databases, desktop publishing, and any other program 
where visually manipulated tabular data is desirable). 

When the system obtains input relating to formatting the table, the appearance of 
the table is dynamically modified so the users can instantaneously view any changes to 
the table caused by the input (e.g., WYSIWYG). The system accepts various types of 
input and upon receipt of that input the system changes the appearance of the table in 
accordance with the input provided. Thus, the user may repeatedly modify the table by 
providing different or additional input and viewing the results of the input. This 
enables users to fluidly add, remove, and/ or otherwise manipulate layout values 
associated with the table. 

In one embodiment of the invention, the user input (e.g., layout information) 
relates to various types of pivot operations, sorting operation, and/or merging 
operations performed on the table. The user may, for example, select a certain field and 
then initiate a pivot operation using the selected field. The specifics of each pivot 
operation are discussed in further detail below. However, generally speaking the goal 
of each pivot operation is to reduce the amount of redundant information shown in the 
table by manipulating the data contained therein. 
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The system is configured in accordance with one embodiment of the invention so 
that the layout information is stored independent of the table data. This arrangement 
enables the system to apply the layout information to any set of tabular data, even if 
that data was not used to determine the layout. Additionally, the system can 
manipulate the tabular data set without changing the underlying structure of the data. 

The process for enabling system to implement one or more embodiments of the 
invention is illustrated in Figure 1. The process initiates when a computer obtains a 
group of records for purposes of formatting (see e.g., Figure 1, step 100). These records 
may or may not be related by at least one common value. When records in a table are 
related by one or more common values, the group of records is referred to as a family. 
The invention contemplates the use of families defined in many different ways. Some 
examples, of families utilized by embodiments of the invention are further described in 
co-pending patent application entitled "DATA INDEXING USING BIT VECTORS" U.S. 
serial number 09/643,316 which is incorporated herein by reference. Further detail 
about the hierarchical structure associated with a family can also be found in the co- 
pending patent application entitled "METHOD AND APPARATUS FOR 
STRUCTURING, MAINTAINING, AND USING FAMILIES OF DATA", which is also 
incorporated herein by reference. 

Once the group of records is obtained by the system (see e.g., step 100) a visual 
representation of the records can be presented to the user for modification. For 
instance, the system may display a subset of the group of records (e.g., step 102) that 
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may include a visual representation of the records associated with the entire family or a 
portion of the family. So that the user can view the layout of the group of records in a 
tabular format, the records are initially presented in a table. Upon viewing the group of 
records the user may provide the system with layout information (e.g., step 104). In one 
embodiment of the invention, the layout information is utilized to rearrange or modify 
the data in the table. This layout information can be stored independent of the data 
contained in the table. The layout information is associated with the table data in that 
the user may select field names or attributes names that are utilized as the values for 
performing various layout operations. However, the layout information can be 
generalized so that the operations defined therein can be applied to any set of tabular 
data regardless of the field and/or attributes values of the data. 

Some examples of the types of operations defined in the layout information 
include pivot operation such as stack pivots, horizontal pivots, vertical pivots, sorting 
information, merging information, inheritance properties, and field and/or attributes 
values to be hidden from view. If, for example, the user selects a field name the selected 
field name can be used as the basis for one or more pivot operations. The specific 
characteristics of each pivot operation will be described in further detail below. 

Once the system obtains the layout information, the table is dynamically updated 
to reflect the changes made (see e.g., steps 106 and 108). The user may continue to add, 
remove, or otherwise modify the layout information (see e.g., steps 110) in order to 
determine how such modifications change the view of the table. The updated table 
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(which may be referred to as a preview table), may contain a less redundant set of 
records than the initial table. Certain records having redundant information may be 
repositioned, merged together, or removed from the table. In the event that providing 
certain layout information makes the table more complex or confusing to the user, the 
user can dynamically modify the table by removing the layout information that added 
such complexity. Once the user determines that additional layout information is not 
desirable, the layout information obtain from the user to generate the preview table can 
be saved and optionally associated with the appropriate group of records. This way the 
user can easily recreate the table without having to provide the same layout information 
again. If the preview table is to be used by another program (e.g., a publication 
program), the table can be exported to that program. If, for instance, the user is 
designing tables for a catalog, the user can provide the finalized table to the catalog 
publication program. If the table is not in a satisfactory format, the user can continue 
modifying the table while dynamically viewing the changes until a satisfactory result is 
achieved. 

Figure 2 illustrates specific types of layout information in accordance with one or 
more embodiments of the invention. For example, layout information may comprise 
inheritance properties, pivot values, hidden values, sorting information, merging 
information, and other information (e.g., steps 226 & 228) relating to the appearance of 
the table. Inheritance properties are typically supplied when the group of records 
represented in the table are arranged in a hierarchical structure (e.g., step 200). In such 
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instances, inheritance properties can be defined on a node-by-node basis. Quid nodes, 
for instance, may inherit from higher nodes in the hierarchy. When inheritance 
properties are supplied, systems embodying the invention take into account the 
inheritance properties when dynamically generating the preview table (e.g., step 202). 
As is the case with a family partitioning hierarchy, all nodes that are children of a 
parent node in the extended taxonomy inherit the layout structure defined for each 
node. The inherited structure can be overridden on a node-by-node basis so that 
different children of a category can have different pivoting, sorting, display sequence, 
and other pivot-specific sorting and display characteristics. Therefore, inheritance 
properties provide user with a mechanism for identifying on a node-by-node basis 
whether that node should inherit any properties. 

If one or more pivot values are selected, pivot operations in accordance with the 
pivot values are executed (see e.g., steps 204 - 216). Any field and/or attribute 
associated with the group of records in the table can become a pivot value. When a 
particular field and/or attribute is identified as a pivot value that value is used during 
the pivot operation. For example, if a stack pivot value is identified, a stack pivot 
operation is executed (e.g., steps 206, 208). When a horizontal pivot value (e.g., step 210) 
or a vertical pivot value (e.g., step 214) is selected, a horizontal pivot (e.g., step 212) or 
vertical pivot (e.g., step 214) operation executes. Layout information in accordance with 
systems embodying the invention may also comprise sorting information and merging 
information. The sorting information identifies the order of sequence of records to be 
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shown in the preview table. Merging information directs the system to combined 
records having identical values. For instance, if a plurality of records in a column 
contain the value "1 inch", the cells in the column can be merged into a single column. 



The following table illustrates a merging operation: 



Brush Size 


Brush Types 


Price 


1 inch 


Sponge brush 


$10.00 


1 inch 


Paint brush 


$7.00 


May become: 


Brush Size 


Brush Types 


Price 


1 inch 


Sponge brush 


$10.00 


Paint brush 


$7.00 



The system also provides a mechanism for identifying hidden values. A value that is 
hidden may be used to perform an operation, but is not shown in the preview table 
output. User may elect to have individual values, fields, attributes, columns or rows 
hidden. 

System Interface / Functionality: 

Systems embodying the invention may contain a graphical user interface that 
enables users to generate and dynamically modify the preview table in real-time. Thus, 
the graphical user interface offers users a WYSIWYG system that automatically 
generates and displays previews of the tabular layout formats (e.g. preview tables) 
based on the layout specifications (e.g., layout information). This is accomplished 
without a report writer or any Hypertext Markup Language (HTML) coding. As Figure 
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3 illustrates, graphical user interface 301 comprises layout portion (300) where the 
records of a table to be manipulated are displayed. Layout portion 300 contains the 
records of a table displayed in tabular form along with the participating fields and/or 
attributes. In some instances (e.g., when the user or system identifies certain fields or 
attributes as hidden fields), not all of the fields or attributes associated with the table are 
shown. The versions that are displayed may therefore have associated fields or 
attributes that are not visibly displayed, but that are part of the tabular data. 

The reader should note that in some instances layout portion 300 contains family 
data, but the invention is not limited to the display of family data. Layout portion can 
contain any type of table data arranged in columns and rows. For instance, layout 
portion 300 may comprise any type of tabular data whether that data is related or 
unrelated to other records in the table. Thus, an embodiment of the invention does not 
require that the table data share a common value. 

The user provides layout information utilized by the system to perform various 
formatting operations upon the tabular data shown in layout portion 300. The reader 
should note that systems embodying the invention do not require the tabular data 
shown in layout portion 300 be visible prior to performing manipulations on the data 
contain therein. Graphical user interface 301 comprises a location for defining layout 
detail (e.g., layout component 302). Layout information comprises any data or 
information that relates to changing the appearance or arrangement of tabular data. For 
example, layout information may comprise inheritance information, pivot values, 
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hidden values, sorting information, merging information, or any other types of data 
relating to modifying the appearance of tabular data. Layout information may associate 
any value such as fields and/or attributes shown in list 304 with an operation (e.g., 306) 
to be performed. For instance, the user may associated one or more field and/or 
attribute from the tabular data with a pivot operation (e.g., stack pivot, horizontal pivot, 
or vertical pivot). 

In particular, the layout information may identify: (a) the fields and/or attributes 
on which to pivot the resulting sub-tables of records, reducing redundant information 
in each of the sub-tables; (b) the fields and/or attributes by which to sort the records in 
each sub-table; (c) the fields and/or attributes that should not be displayed in the 
published output; (d) the display sequence of the fields and /or attributes that have not 
been hidden nor used to pivot; and (e) pivot-specific sorting and display information to 
be applied on a pivot-by-pivot basis. This layout specification is performed and stored 
in accordance with one embodiment of the invention on a family-by-family basis so that 
not only fields but also category-specific attributes can be used to define the pivoting, 
sorting, display sequence, and other pivot-specific sorting and display characteristics 
for each family. Multiple pivots of the same type can be nested, while pivots of 
differing types can be combined. In one embodiment of the invention, the taxonomy 
structure of the underlying data is extended for each family to include such layout 
specifications. 
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Dynamic Real-Time Previews: 

As the layout information is obtained, the system automatically applies the 
layout information to the tabular data thereby updating the tabular data shown in 
layout portion 300 in real time. The updated table (which may be referred to as a 
preview table) provides the user with instantaneous interactive feedback as to the 
effects of the layout information. The user may obtain further feedback by iteratively 
revising the layout information and allowing the system to dynamically adjust the 
preview to account for the modified layout information. Thus, embodiments of the 
invention provide a mechanism for obtaining layout information that relates to a set of 
tabular data and instantaneously generating a corresponding preview in real time, 
thereby providing instant interactive feedback to the user. The user may continue to 
iteratively refine the layout information by tweaking and fine-tuning that information 
until the appearance of the preview table is satisfactory. 

Layout Storage: 

In one embodiment of the invention, the layout information (e.g., structure / 
formatting data) is stored independent from the tabular data itself and/or the 
partitioning hierarchy that further extends upon the underlying taxonomy structure. 
Thus, the user may apply the layout information to different sets of tabular data 
without having to redefine the layout of that data. Storing the layout information 
independent of the data also allows the system to flexibly modify the appearance of the 
data without changing the underlying relationships or integrity of the data from which 
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the tabular data is derived. If, for example, the tabular data is obtained from a database, 
the referential integrity of the database need not be altered even though the appearance 
of the data is modified. When a partitioning hierarchy is utilized to define the tabular 
data to be manipulated, the hierarchy is not altered even though the layout of the 
tabular data is changed. 

Storing layout information in this manner results in a system in which a report 
writer (or HTML) requires no complex code for pivoting tabular layout formats, no 
special coding for each category or family, and no intelligence about the underlying 
data. Instead, everything is driven by the extended taxonomy structure, and changes 
that occur in the taxonomy as well as the underlying records themselves can be 
immediately reflected in the output. In effect, the intelligence about how to layout and 
format the records in each family are built into the taxonomy itself rather than into 
special category- and family-specific programming code in the report writer. The 
reader should note that layout information may also be stored in a way that is directly 
associated with the tabular data to which it relates. In this instance, the tabular data 
and the layout information can be part of the same file or in separate files that are 
related to one another. 

Pivot Operations: 

Figure 4 illustrates a table of data 400 (e.g., family data) before the execution of 
any pivot operations. The available fields and/or attributes 402 are shown, as are the 
various layout operations 404 that can be performed on the data. Inheritance properties 
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408 are also shown. When the table contains family data, the family hierarchy showing 
the category and the value upon which a partition was made may be shown. The 
reader should note, however, that aspects of the invention are applicable to the 
manipulation of any type of tabular data and that the invention is not limited to 
formatting family data, but provides a mechanism for dynamically formatting any type 
of table data in real-time. 

Figure 5 illustrates the functions executed when a stack pivot, horizontal pivot, 
or vertical pivot is requested. When the system performs a pivot operation, the values 
utilized for each pivot operation are typically obtained from the tabular data to be 
manipulated (see e.g., Figure 5, step 500 and tabular data 300). The purpose of each 
pivot operation is to reduce the amount of redundant information that ultimately ends 
up in the preview table. The user may elect to hide the pivot values so that the 
information relating to such values is not shown in the preview table. Although there 
are several types of pivot operations, generally speaking a pivot operation is performed 
by identifying a pivot axis (e.g., a column or row) in a table that corresponds to the 
identified pivot value (e.g., steps 502 and 504). The pivot axis is then removed from the 
table (e.g., step 506) and the system generates a preview table by breaking the preview 
table to sub-tables based on the pivot axis (e.g., step 508). The group of records in the 
table may then be sorted into sub-tables based on the pivot value of the pivot axis. 

Figures 6-8 illustrate a few specific examples of preview table as different types 
of pivot operations are dynamically applied to the tabular data. 
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A) Stack Pivot 

One type of pivot operation is referred to as a stack pivot (e.g., step 510). The 
stack pivot recombines the sub-tables into the preview table in a vertical arrangement. 
Optionally, the system may add an additional row to the preview table that contains the 
pivot value, preserve each of the sub-tables, and label the sub-tables with at least one 
pivot value (e.g., step 512). Each pivot operation can be nested within another pivot 
operation. Thus, multiple pivots can be performed. An additional stack pivot, for 
example, could be nested within the initial stack pivot. 

Figure 6 illustrates the table shown in Figure 4 after a stack pivot is performed. In 
this example, the user selected the fields / attributes of "main picture" and "film type" 
as pivot values 600. Upon selection of those values the system automatically applies a 
stack pivot operation against table 400 thereby transforming the table into preview table 
602. Thus, preview table 602 now contains values representative of film, made by 
Kodak™, and separated by film type. 

B) Horizontal Pivot 

An additional type of pivot operation is referred to as a horizontal pivot (e.g., 
step 514). The horizontal pivot recombines the sub-tables into the preview table by 
arranging the sub-tables horizontally. The system may also add an additional row to 
the preview table that contains the pivot value used to perform the pivot and use that 
row to label the sub-tables (e.g., step 516). Multiple horizontal pivots can be performed 
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and such pivots can be combined with other pivot operations. For instance, the system 
may perform a stack pivot and horizontal pivot on the same data. Horizontal pivots can 
also be nested within other horizontal pivots- 
Figure 7 illustrates the table shown in Figure 6 after a subsequent horizontal 
pivot is performed using "film speed" as the horizontal pivot value 700. The horizontal 
pivot is applied to preview table 602 in real-time upon selection of the horizontal pivot 
value thereby automatically transforming preview table 602 into preview table 702. 
Preview table 702 now shows the different film types separated by film speed. 

C) Vertical Pivot 

A vertical pivot recombines the sub-tables into the preview table by arranging 
the sub-tables vertically (e.g., steps 518 and 520). Optionally, the system may add an 
additional column containing one or more pivot values for purposes of labeling a group 
of rows in the tables that make up each sub-table. 

Figure 8 illustrates Figure 7 after a vertical pivot operation is executed. The pivot 
value utilized to minimize the redundancy in preview table 702 is vertical pivot value 
800. Using "exposures" as the vertical pivot value. Upon identification of "exposures" 
as the field to use for the vertical pivot operation, the system in accordance with one or 
more embodiments of the invention transforms preview table 702 into preview table 
802. Preview table 802 conveys the same information originally shown in table 400 but 
has minimized the amount of redundant or irrelevant information in the table. Once 
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the table is manipulated to the point where the user is satisfied with the appearance of 
the table it can be optionally output to another computer program or provided to a 
publication mode. 

Figure 9 is a generalization of several pivot operations in accordance with an 
embodiment of the invention. Block 900 illustrates a stack pivot, block 902 illustrates a 
vertical (row) pivot), and block 904 illustrates a horizontal (column) pivot. 

Publication: 

When the appearance of the table is finalized, the layout information is saved. A 
table generated using that layout information may be provided to a publication 
program for further processing. The layout information may be applied against 
multiple sets of data and revised using the process described herein. 
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Section A 

General Terminology 

Database 

■ A database is a logical collection of interrelated information, managed and 
stored as a unit. 

■ A record is a representation of a real-world object such as a person, a product, or 
a company. A record consists of one or more individual data elements. 

■ A field describes one of the data elements of a record and is common to all the 
records in a table. 

■ A table is a simple, rectangular, row/column arrangement of related data values. 

■ Each horizontal row in the table represents a single record and consists of the 
same set of fields. 

■ Each vertical column of the table represents one field that is stored for each row 
in the table. 

■ A cell is the intersection of a row and a column in a table and contains the data 
value for a particular field of a particular record. 

■ A relational database is a database in which all data is organized into tables that 
may be related by matching columns. 
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■ A relational database management system (RDBMS) is a software system that 
stores and retrieves data in a relational database. 

■ A lookup uses a pair of matching columns from two tables, taking the value of 
the column for a single record in the first primary table to "look up" additional 
information in a single corresponding record in the second lookup table. 

■ A join combines information from two tables by performing a lookup on every 
record of the primary table. 

■ Value limiting on a lookup table reduces the set of lookup values by eliminating 
from the set of all possible lookup values those values that do not correspond to 
any records in the primary table. 

Hierarchy 

■ A hierarchy is a table in which the records have parent/child relationships. 

■ A node is another term for a record in a hierarchy. 

■ The root node of a hierarchy is a node that has no parent. 

■ An internal node of a hierarchy is a node that has at least one child. 

■ A leaf node of a hierarchy is a node that has no children. 



27 



WO 02/25500 



PCTYUS01/29486 



Attributes 

■ An attribute is a data element that is not common to all the records in a table. 

■ A category is a subset of the records of a table that has a set of common 
attributes. Each record in a table must belong to exactly one category. 

■ A taxonomy is the partitioning of a table and its records into multiple categories, 
with or without hierarchy, along with the assignment of attributes to each of the 
categories. 

Families and Pivots 

■ A family is a group of records in a table which are related by one or more 
common fields and/or attributes that have the same value, and which may also 
have additional fields of common information, such as an image, a logo, a 
paragraph of descriptive text, bullets of specifications, and so on. 

■ A presentation is a formatted family layout consisting of both the common 
information and the tabular information for the group of related family records. 

■ A partition is the division of a group of records into one or more subgroups, 
each of which is defined by the set of records from that group that have a fixed 
set of values for one or more fields and/or attributes. The partition is specified 
by the set of fields and/or attributes whose values or value combinations will 
define the subgroups. 
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■ The partitioning table is the main table of records that is to be divided into 
partitions. 

■ A partitioning hierarchy of a partitioning table is a hierarchy in which the nodes 
of the hierarchy represent partitions of the partitioning table. 

■ A partitioning node is a node in the partitioning hierarchy that corresponds to a 
particular family of records. Since a partition simply divides a group of records 
into sub-groups, the set of records represented by a partitioning node is exactly 
the set of records represented by combining the sets of records represented by 
each of the descendants of that partitioning node. The root partitioning node (or 
root partition) represents the entire set of records of the partitioning table; each 
sub-node represents only those records which have the fixed set of field and /or 
attribute values defined by the partitions starting at that sub-node and tracing 
ancestors back up to the root; the entire set of leaf partitioning nodes (or leaf 
partitions) represents the entire set of records; and each record of the 
partitioning table belongs to one and only one leaf partitioning node. 

■ A base family is a family that corresponds to a leaf partitioning node. 

■ The base family set is the complete set of base families that corresponds to the 
complete set of leaf partitions in a partitioning hierarchy. The base family set is 
useful because each record of the partitioning table belongs to exactly one base 
family. 
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■ A pivot reduces redundant information in a table of records by restructuring the 
table using one or more pivot columns. Specifically, it eliminates the pivot 
columns from the table, sorts and groups the records into multiple sub-tables 
based on the value or value combinations of these columns, and then labels each 
sub-table with these pivot values. Pivots are similar to partitions in that they 
divide a group of records into sub-groups. However, the sub-groups created by a 
partition are families, while the sub-groups created by pivots are used to control 
the layout of a particular family. Specifically, there are three types of pivots, each 
of which controls the layout of a family by affecting the arrangement of the 
resulting sub-tables and the corresponding placement of the pivot values. 

■ A stack pivot (or depth pivot) recombines each of the resulting sub-tables into a 
single table by arranging them vertically on top of each other, and adds an 
additional row containing the pivot values before each sub-table. Alternatively, 
each of the resulting sub-tables can be preserved, and each simply labeled with 
the pivot values. 

■ A row pivot (or vertical pivot) recombines each of the resulting sub-tables into a 
single table by arranging them vertically on top of each other, and adds an 
additional column containing the pivot values to label the group of rows 
comprising each sub-table. 

■ A column pivot (or horizontal pivot) recombines each of the resulting sub-tables 
into a single table by arranging them horizontally side-by-side, and adds an 
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additional row containing the pivot values to label the set of columns comprising 
each sub-table. Multiple pivots of the same type can be nested, while pivots of 
differing types can be combined. 
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Structure for Efficient Storage of Families and for Automatically Placing 



Records into Them 

Problem Statetnent 

When publishing the contents of a catalog, records often need to be organized 
into a more granular structure than that provided by the categories of the taxonomy. 
Moreover, this granularity often involves grouping records based on not only the 
category value but also other criteria (e.g. manufacturer). Families provide a way of 
identifying these groupings by fixing one or more field and/or attribute values. 
However, several problems exist in defining structures to efficiently store and retrieve 
these families of records. First, it must be possible to find all the records that belong to a 
particular family, so that when a family is published, the records belonging to that 
family can be easily identified. Second, a record should belong to only one base family 
so that there is a direct mapping from records to base families. This enables similar 
products to be easily found. Third, as new records are added to the partitioning table, 
they must be assigned to the proper base family. Finally, some or all of the records of a 
given family may need to be reassigned to a different family if they no longer have the 
criteria which define that family because the records are modified or the definition for 
the family is changed. 

For illustration purposes herein, a taxonomy is used wherein a table and its 
records may be partitioned into categories, with or without a hierarchy, where each 
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category comprises a set of common attributes. A category's attributes may not be not 
physically part of a record, the attributes are considered part of a definition of the 
record where the record contains a reference to the category. Examples will be based on 
the following taxonomy and data: 



Category ID 


Category 


Parent ID 


Position 


1 


Printers 


0 


0 


2 


Daisy Wheel Printers 


1 


0 


3 


Dot Matrix Printers 


1 




4 


Inkjet Printers 


1 


2 


5 


Laser Printers 


1 


3 



Attribute ID 


Attribute 


Type 


! 


Pages Per Minute (ppm) 


Numeric 


2 


Color 


Text 



Attribute ID 


Feature ID 


Feature 


2 


1 


Color 


2 


2 


Black & White 



Category ID 


Attribute ID 


1 


1 


1 


2 



The first four tables to the left define the 
following taxonomy: 

Printers {ppm, color) 
Daisy Wheel Printers 
Dot Matrix Printers 
Inkjet Printers 
Laser Printers 

The taxonomy provides an example 
of a category hierarchy with five 
categories, the root category being 
"Printers" and the remaining 
categories being child (and leaf 
node) categories of the "Printers" 
category. 



ID 


Model 


Manufacturer 


Category ID 


Description 


Price 


1 


ALP1 


Acme 


5 


8 pages per minute; black & white 


$500 


2 


AUP1 


Acme 


4 


3 pages per minute ink; black & white 


$150 


3 


ALP2 


Acme 


5 


8 pages per minute; color 


$4000 


4 


ADMP1 


Acme 


3 


3 pages per minute; black & white 


$100 


5 


BLP1 


Best 


5 


20 pages per minute; color 


$5000 


6 


BLP2 


Best 


5 


20 pages per minute; black & white 


$1000 


7 


BUI 


Best 


4 


4 pages per minute; color 


$250 


8 


BDWPI 


Best 


2 


2 pages per minute; block & white 


$75 



The first table, or category table, defines categories within the taxonomy. The 
category table includes a "Parent ID" field that may be used to define a hierarchy 
and, more particularly, a category's level within a category hierarchy. The 



33 



WO 02/25500 PCT/US01/29486 
"Position" field identifies a position within a hierarchical level for a given 

category. Each of the records in a uniform fields table (i.e., the fifth table) 

references a category record in the category table that defines additional data 

elements (or attributes) of the referencing record. An attributes table (i.e., just 

below the category table) defines attributes that may be included in a category. 

The third table, a featurevalues table may be used to define enumerated values of 

an attribute of the attributes table. In the example, the featurevalues table 

identifies two enumerated values for the "color" attribute. The fourth table, or 

categoryattribute table, identifies the attributes that are associated with a record 

of the category table. Inheritance may be used to allow child categories to inherit 

attributes that are associated with a parent category. 

The families in the examples will be defined by the combination of 
manufacturer and category. 

Known Solutions and Their Shortcomings 

The "Table Per Family" Approach 

The "table per family" approach partitions the records into families by storing 
the records of each family in its own table. 



ID 


Model 


Manufacturer 


Category ID 


Description 


Price 


1 


ALP1 


Acme 


5 


8 pages per minute; black & white 


$500 


3 


ALP2 


Acme 


5 


8 pages per minute; color 


$4000. 




ID 


Model 


Manufacturer 


Category ID 


Description 


Price 


2 


AIJP1 


Acme 


4 


3 pages per minute ink; black & 
white 


$150 




ID 


Model 


Manufacturer 


Category ID 


Description 


Price 1 


4 


ADMP1 


Acme 


3 


3 pages per minute; black & white 


$100 
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ID 


Mode! 


Manufacturer 


Category ID 


Description 


Price 


5 


BLPl 


Best 


5 


20 pages per minute; color 


$5000 


6 


BLP2 


Best 


5 


20 pages per minute; black & 
white 


$1000 




ID 


Model 


Manufacturer 


Category ID 


Description 


Price 


7 


B1J1 


Best 


4 


4 pages per minute; color 


$250 




ID 


Model 


Manufacturer 


Category ID 


Description 


Price 


8 


BDWP1 


Best 


2 


2 pages per minute; black & white 


$75 



This approach provides for efficient storage of the data. However, as the number of 
families increases, so does the number of tables. Data management and searching for 
records then become increasingly complex and time-consuming because they require 
that many tables be accessed. 

Changes to the family definition require complex restructuring of the tables and 
reorganization of the records contained within them. For example, if the families were 
changed to be defined as the combination of the category and color attribute, then six 
new tables (Laser / Color, Laser / B&W, Inkjet / Color, Inkjet / B&W, Dot Matrix / 
B&W, and Daisy Wheel / B&W) would need to be created and populated, and the old 
tables would need to be destroyed. 

The 'Table Lookup" Approach 

The "table lookup" approach requires three steps. First, a table containing a 
record for each of the families must be created. Second, a lookup field for the family 
must be added to the partitioning table. Third, the id of the proper family record in the 
family table must be placed into this field for each record of the partitioning table to 
create the relationship between records and their corresponding family. 
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Family ID 


Description 


1 


Acme Laser Printers 


2 


Acme Inkjet Printers 


3 


Acme Dot Matrix Printers 


4 


Best Laser Printers 


5 


Best Inkjet Printers 


6 


Best Daisy Wheel Printers 



ID 


Model 


Manufacturer 


Category ID 


Description 


Price 


Family ID 


1 


ALP1 


Acme 


5 


8 pages per minute; black & white 


$500 


1 


2 


AIJP1 


Acme 


4 


3 pages per minute ink; black & 
white 


$150 


2 


3 


ALP2 


Acme 


5 


8 pages per minute; color 


$4000 


1 


4 


ADMP1 


Acme 


3 


3 pages per minute; black & white 


$100 


3 


5 


BLP1 


Best 


5 


20 pages per minute; color 


$5000 


4 


6 


BLP2 


Best 


5 


20 pages per minute; black & 
white 


$1000 


4 


7 


BIJ1 


Best 


4 


4 pages per minute; color 


$250 


5 


8 


BDWP1 


Best 


2 


2 pages per minute; black & white 


$75 


6 



This approach has several major drawbacks. First, the manual process of assigning the 
family ids is time-consuming, error-prone and extremely tedious. Second, changes to 
the record do not result in the product being properly reassigned to the correct family. 
Third, changes to the families may require that some or all of the records of the family 
be reassigned. 

Alternative Solutions and Their Shortcomings 

The "Taxonomy" Approach 

Under this approach, the taxonomy (as defined in A2i's previous patent 
application) itself is extended in so that each family will become a leaf node in the 
taxonomy by building the fixed values for the fields and/or attributes defining the 
family into the category structure of the taxonomy. 
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Category ID 


Category 


Parent 

lis 


Position 


1 


A JULLJL 

Acme 


U 


n 
u 


2 


Acme Printers 


1 


n 
U 








1 


4 


Acme Inkjet Printers 


1 


2 


5 


Acme Laser Printers 


1 


3 


6 


Best 


0 


1 


7 


Best Printers 


6 


0 


8 


Best Daisy Wheel Printers 


6 


1 


9 


Best Inkjet Printers 


6 


2 


10 


Best Laser Printers 


6 


3 



The first four tables to the left define the 
following taxonomy: 

Acme 

Acme Printers (ppm, color) 
Acme Dot Matrix Printers 
Acme Inkjet Printers 
Acme Laser Printers 

Best 

Best Printers [ppm, color) 
Best Daisy Wheel Printers 
Best Inkjet Printers 
Best Laser Printers 



Attribute ID 


Attribute 


Type 


1 


Pages Per Minute (ppm) 


Numeric 


2 


Color 


Text 




Attribute ID 


Feature ID 


Feature 


2 


1 


Color 


2 


2 


Black & White 



Category ID 


Attribute 
ID 


1 


1 


1 


2 



ID 


Model 


Manufacturer 


Category ID 


Description 


Price 


1 


ALP1 


Acme 


5 


8 pages per minute; black & white 


$500 


2 


AIJP1 


Acme 


4 


3 pages per minute ink; black & 
white 


$150 


3 


ALP2 


Acme 


5 


8 pages per minute; color 


$4000 


4 


ADMP1 


Acme 


3 


3 pages per minute; black & white 


$100 


5 


BLP1 


Best 


10 


20 pages per minute; color 


$5000 


6 


BLP2 


Best 


10 


20 pages per minute; black & 
white 


$1000 


7 


BIJ1 


Best 


9 


4 pages per minute; color 


$250 


8 


BDWP1 


Best 


8 


2 pages per minute; black & white 


$75 



This has several shortcomings. First, the taxonomy structure will become polluted with 
information that has nothing to do with the original category- and attribute-based 
taxonomy. A single category of related database' records will be broken into multiple 
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categories, obscuring the actual relationship between the records and the original 
category. In this example the taxonomy is polluted with data about the manufacturers. 
Should new manufacturers be added or existing ones removed, then the taxonomy will 
require extensive changes. 

Second, there is a tradeoff between searchability and data redundancy: 
preserving the fields and attributes used to extend the taxonomy maintains 
searchability but results in duplicate data; eliminating them to avoid data redundancy 
restricts searchability. In this example there is data redundancy, because the printer 
attributes now have to be linked to two nodes (Acme Printers and Best Printers) instead 
of only one node (Printers in the base example). Suppose a new attribute is needed such 
as interface type. In the base example, the new attribute needs only be linked to one 
node, but in this example, it needs to be linked to two nodes. With more data, there will 
tend to be more data redundancy. Modifications will require changes to be made in 
multiple places, each increasing the risk of error being introduced into the data. 
Maintaining the taxonomy becomes increasingly complex since it ends up being 
extended with so much additional detail. Data redundancy can be eliminated from the 
taxonomy by choosing not to include the redundant data, but this is generally not an 
acceptable solution as it severely restricts the set of search parameters. 

Third, finding similar products becomes more difficult as they may no longer 
reside in the same category. For example, given product model ALP1, the category 
could have previously been looked up (Laser Printers), and a search could then be done 
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on that category to find similar products. Using this approach, the category lookup will 
find only Acme Laser Printers, but not Best Laser Printers. 

The "Stored Query" Approach 

Because the related records in a family have the same fixed values for a set of 
fields and/or attributes, they can be identified by a query specifying these common 
values. This query can be stored and later referenced to identify and locate the records 
for the family. 



Query Name 


Query 


Acme Laser Printers 


Monufacturer=Acme; Category=Laser Printers 


Acme Inkjet Printers 


Manufacturer=Acme; Category=Inkjet Printers 


Acme Dot Matrix Printers 


Manufacturer=Acme; Category=Dot Matrix Printers 


Best Laser Printers 


Manufacturer=Best; Category=Laser Printers 


Best Inkjet Printers 


Manufacturer=Best; Category=Inkjet Printers 


Best Daisy Wheel Printers 


Manufacturer=Best; Category=Daisy Wheel Printers 



This approach also has several shortcomings. First, there are a variety of problems 
setting up and maintaining the queries. Setting up the queries is time-consuming and 
error-prone, because each must be manually done. It requires that each query be given a 
name or id of some sort so that it can be referenced, and with a large number of families 
it quickly becomes difficult to organize and manage the set of family queries. There is 
no way to guarantee that the set of queries will contain the entire set of records, while 
also ensuring that each record belongs to exactly one query; that is, some queries may 
inadvertently overlap so that a single record belongs to multiple families, or the queries 
may not provide adequate coverage, so that some records may not belong to any family. 
The relationship between the families is not visually obvious from the queries, nor is 
there any single structure that identifies, illustrates, or maintains these relationships. 
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Finally, while the queries identify which records belong to the family, they fail to 
provide an efficient way to determine to which family a particular record belongs. 
Finding the family for a particular record would require examining each of the queries, 
one at a time, to see if the record matched the criteria for that query. 

Improved Solution 

The improved solution takes advantage of the fact that each family is defined by 
fixing a set of common values for one or more fields and/or attributes. The base 
families are organized into a partitioning hierarchy by partitioning the complete set of 
base families according to the common values that define each of them. Since a 
category- and attribute-based taxonomy already exists, it would be beneficial to layer 
the partitioning hierarchy on top of it, so as to leverage the work already done to create 
the taxonomy. This simply requires using the category field to define the first partition 
in the partitioning hierarchy. At first this might appear to be the same as the Taxonomy 
approach presented above. The difference lies in the fact that the partitioning hierarchy 
is layered on top of the existing taxonomy, rather than incorporate the family 
information directly into the taxonomy. The partitioning hierarchy is stored as a 
hierarchical structure (as explained in A2i's previous patent application). An additional 
table is used to store the fixed field and/or attribute that define the partitions. The table 
contains the following fields: the id of the partitioning node, the field or attribute which 
is being partitioned, and positional information to allow for combining and nesting 
partitions. The reason that an additional table is required, as opposed to storing the 
partitioning information directly as part of the hierarchy table, is that there may be 
multiple fields and/or attribute that define a partition. For example, a partition could be 
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defined based on the combination of a field (such as manufacturer) and an attribute 
(such as color). 



Family ID 


Family 


Parent 
ID 


Positio 
n 


1 


Printers 


0 


0 


2 


Daisy Wheel Printers 


1 


0 


3 


Best Daisy Wheel Printers 


2 


0 


4 


Dot Matrix Printers 


1 


1 


5 
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The first table to the left defines the following 
family partitioning hierarchy: 

Printers (ppm, color) 
Daisy Wheel Printers 

Best Daisy Wheel Printers 
Dot Matrix Printers 

Acme Dot Matrix Printers 
Inkjet Printers 

Acme Inkjet Printers 

Best Inkjet Printers 
Laser Printers 

Acme Laser Printers 

Best Laser Printers 



Family ID 


Field 


1 


Manufacturer 



Notice that the family partitioning hierarchy has the same initial structure of the 
taxonomy, but additional nodes are added to it. These nodes are created because a 
partitioning exists at the Printers node that is defined to partition by manufacturer. This 
causes all leaf nodes under this node to be further partitioned by manufacturer. The 
initial leaf nodes were Daisy Wheel Printers, Dot Matrix Printers, Inkjet Printers, and 
Laser Printers. Under each of these, additional nodes will be added for each 
manufacturer that has products defined by the query constructed by taking all the 
criteria defined by the ancestor nodes in the family partitioning hierarchy. Since this is 
the first partition, the criteria are simply the category for each of the initial leaf nodes. 
Notice that a node is not added for all manufacturers, only those that correspond to 
actual records in the database. 
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One important constraint on this approach is that whenever a new category is 
added or removed to the taxonomy, the corresponding portion of the family 
partitioning hierarchy must also be adjusted in the same manner. This will result in a 
change of base families. 



This idea can be extended to reflect changes in the possible values for other fields 
and attributes in the family partitioning hierarchy. Thus, when a value is 
added/removed from the set of possible values for a particular partition, the 
corresponding node will be added/removed from the family partitioning hierarchy. 
This is illustrated below where in addition to partitioning on the category (the initial 
taxonomy) and manufacturer (the additional nodes added to account for the 
manufacturer), a partition by Color is also performed on the Laser Printers node. 
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The first table to the left defines the following 
family partitioning hierarchy: 

Printers 

Daisy Wheel Printers 

Best Daisy Wheel Printers 
Dot Matrix Printers 

Acme Dot Matrix Printers 
Inkjet Printers 

Acme Inkjet Printers 
Best Inkjet Printers 
Laser Printers 

Acme Laser Printers 

Color Acme Laser Printers 
B&W Acme Laser Printers 
Best Laser Printers 

Color Best Laser Printers 
B&W Best Laser Printers 
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Notice that partitioning information for Color has been added to the Laser 
Printers node, and that only descendants of that node are affected. Also notice 
that a second occurrence of a manufacturer partition has been added. The reason 
for this is that descendant nodes inherit partition information. In other words, all 
descendant nodes of a particular "ancestor" node are automatically assigned the 
same partition information that is assigned to the ancestor, which makes setting 
up and maintaining partitions much more efficient. However, if there were no 
way to override the partition settings of an ancestor node, inheritance would 
always affect all descendant nodes. To get around this problem, inheritance does 
not affect a node that has any partitions defined nor does it affect any of its 
descendants; rather the descendants inherit the override partition settings. In 
order to get the partition defined for an ancestor as well as a custom partition, a 
node must define both partitions. If the second occurrence of the manufacturer 
partition had not been added, then the family partitioning hierarchy would be as 
follows: 
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The first table to the left defines the following family partitioning 
hierarchy: 

Printers 

Daisy Wheel Printers 

Best Daisy Wheel Printers 
Dot Matrix Printers 

Acme Dot Matrix Printers 



Inkjet Printers 

Acme Inkjet Printers 
Best Inkjet Printers 

Laser Printers 

Color Laser Printers 
B&W Laser Printers 



There is also a difference based on the ordering of the partitions. Had the Manufacturer 
Name partition been added after the Color partition, then the result would be as above 
with two nodes added under each of the Color Laser Printers and B&W Laser Printers 



nodes for the Acme and Best manufacturers. 



Partitioning by multi-valued fields and attributes is given special treatment to 
ensure that a record belongs to exactly one family. The combination of values is treated 
as a distinct unit when determining the unique set of values for the field /attribute. For 
example, if there was a partition on a multi-valued field of color and one of the records 
had Blue/Green as the value for that field, then the record would be placed in the 
Blue/Green family, and not in the Blue family or Green family. 

In order to find the records that belong to a particular family, a query can be 
constructed by setting constraints for each value from the fixed set of common values 
for that family. Executing that query will result in the set of records that belong to that 
family. 

Since the partitioning hierarchy is organized so that each branch, from a node to 
its sub-nodes, differ in the value or value combination on which the node is partitioned, 
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Thus the queries constructed for each of the base families will also differ by at least one 
constraint. The result is that each query is guaranteed to return a non-overlapping set of 
records. 

The linkage between families and records is accomplished automatically by 
constructing queries with the appropriate constraints for that family, as opposed to the 
manual process of linking each record to the proper family. This delivers the added 
benefits that when new records are added or existing records are modified, they will 
belong to the proper family automatically. Also, if the partitioning hierarchy is 
restructured so that family definitions change, each record of the partitioning table will 
automatically belong to its proper family. 
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Implementation and Method of Use 

See the discussion on families in the section entitled Catalog Manager Data Format. 
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Example 1 . The End Brushes category is partitioned by Manufacturer Name (inherited from 
ancestor node) 
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Example 2. A second partition is created which partitions by the combination of End 
Type and Brush Type 
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The Innovations 

Among the innovations are: (a) support for families and the partitioning 
hierarchy; (b) an efficient storage for families which allows products to be found from 
families, and conversely, families to be found from products; (c) layering the 
partitioning hierarchy on top of a category- and attribute-based taxonomy to leverage 
the existing taxonomy; (d) a method for automatically creating new families as the set of 
actual field and attribute values is changed; (e) a method for ensuring that product 
records automatically belong to the proper family, even as new records are added and 
existing records are modified; (f) the ability to partition at any level in the partitioning 
hierarchy, so that different nodes within a single partition can be partitioned differently, 
and (g) the inheritance and overriding of inheritance of partition information in the 
partitioning hierarchy. 
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Method for Automatically Maintaining Product Families 

Problem Statement 

After the family partitioning hierarchy has been created, it must then be 
maintained when changes are made to the taxonomy structure or to the domain of 
fields and attributes used for partitioning. Changes to the taxonomy structure that 
require updates to the partitioning hierarchy include adding, removing, moving, or 
modifying a category. Changes to the domain of a partitioning field include adding, 
removing or modifying a field value, while changes to the feature domain for a 
partitioning attribute include adding, removing or modifying a feature value. 

A second problem arises as a result of an optimization that avoids creating a 
family partitioning hierarchy that contains a high percentage of families with no 
records. In the previous section, we had assumed that the set of possible values and 
value combinations and the set of actual values and value combinations in existing main 
table records were identical. The optimization recognizes that this is not likely to be the 
case, and that in fact, the number of actual values and value combinations will be 
substantially less than the number of possible values and value combinations. 

Note that using the set of possible values and value combinations when creating 
families, the partitioning hierarchy becomes unnecessarily large because it will contain 
many families that contain no records. To illustrate this point, consider a catalog with 
200 categories, 500 manufacturers, and 10,000 products. If category were to be 
partitioned by manufacturer, the "cross-product" approach of using the possible value 
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combinations would create 100,000 families in the partitioning hierarchy, even though 
the main table contains only 10,000 product records! Most of these families would in 
fact contain no records, since for a particular category, only a small subset of 
manufacturers offers products (and conversely, each manufacturer offers just a small 
number of categories of products). 

By contrast, using only the set of actual value combinations that occur in the 
main table records reduces the number of families dramatically to precisely those 
containing records (and certainly no more than the number of products in the main 
table) and results in a much more compact partitioning hierarchy. A consequence of this 
optimization, however, is that the partitioning hierarchy must now be maintained not 
only across changes to the taxonomy structure and domains of partitioning fields and 
attributes, but also across changes to main table records. These changes include adding, 
removing, or modifying main table records. 

The Solution 

The solution is to automatically adjust the partitioning hierarchy when either the 
taxonomy structure, the domain of a partitioning field or attribute, or main table 
records are modified. 

Since the partitioning hierarchy is layered on top of the taxonomy, changes to the 
structure of the taxonomy hierarchy require updates to the partitioning hierarchy. In 
particular, nodes that are added, removed, modified, or moved in the taxonomy must 
be similarly added, removed, modified, or moved in the partitioning hierarchy. In 
addition, many of the advanced features for in place schema and data manipulation 
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such as splitting and merging attributes (discussed in the previous patent application) 
can also require updates to the partitioning hierarchy. 

Since the partitioning hierarchy depends on the existence of values in actual 
product records, changes to the main table records may require updates to the 
partitioning hierarchy. When records are added to the main table, new families need to 
be created if the records contain a value not yet used in any of the fields and/or 
attributes that are used in defining the family partitions. Similarly, if a record is deleted 
from the main table and that record is the only record in the main table to contain a 
particular value for one of the family partitioning fields or attributes, the corresponding 
partitioning node needs to be removed. Modification of a main table record can have 
effects similar to those of adding a new record or deleting an existing one since a new 
value assigned to a field or record could be a value not yet used in one of the family 
partitioning fields/attributes and the value replaced could have been the only 
occurrence of a particular value in the family partitioning field /attribute. The merging 
of attribute values in the taxonomy has the same effect as modifying the main table 
records by replacing the original attribute values with the merged attribute value. As 
such it can similarly require updates to the partitioning hierarchy. 

Note that updates to the partitioning hierarchy to reflect changes to the domain 
of a partitioning field or attribute are automatically handled through the handling of 
changes to the main table records. This is because changes to a domain no longer affect 
the partitioning hierarchy unless the added, removed or modified value is actually in 
use in the main table records. 
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Implementation and Method of Use 

See the discussion on families in the section entitled Catalog Manager Data 
Format. 

The Innovations 

Among the innovations are: (a) support for automatically maintaining product 
families and the partitioning hierarchy; (b) creating partitioning nodes based on the 
actual set of values and value combinations used in main table records rather than the 
possible set of values and value combinations; (c) detecting when the partitioning 
hierarchy needs to be updated due to modifications of the taxonomy or main table 
records; and (d) automatically maintaining the partitioning hierarchy when such 
changes are detected. 
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Structure for Maintaining Common Information for Families 
Problem Statement 

Very often, a database must store fields of common information that relate to a 

family of related records rather than just a single record. The challenge is to do so in a 

way that is efficient to store, easy-to-implement for existing data, and easy-to-maintain 

as additional records are added to the database. 

Known Solutions and Their Shortcomings 

The "Single Table" Approach 

The "single table" approach stores all of the data values for a main table record, 
including the common information that applies to an entire family of records, within 
the record itself in the single main table. As a result, the table structure is very simple, 
but at the same time, it is both wasteful of storage because the common data values are 
duplicated in multiple records, and wasteful of effort because each of the values must 
be entered manually and repetitively for each of the multiple records in a family. In 
addition, a change to any of the common data values is not automatically propagated to 
the entire family of records, rather; the data value must be updated in each of the 
multiple records that contain the value, introducing the potential for inconsistency and 
error. 

The "Multi-Table" Approach 

The "multi-table" approach is consistent with the relational data model and uses 
multiple tables to store related information. The primary table stores the specific 
information about each main table record while a lookup table contains a record for 
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each family that stores the fields of common information. Records in the tables are 
linked by placing in both tables an id that links each record in the primary table to the 
corresponding record in the lookup table. The advantage of this approach is that the 
common data values are stored only once in a single record in the lookup table, 
eliminating duplication and saving space; additionally, changes to the single copy of the 
common information are automatically reflected for all the records of a family. The 
drawback of this approach is that the link between each record in the primary table and 
corresponding record in the lookup table still needs to be defined manually; similarly, 
new records that are added to the database must be manually linked to the common 
information by the user rather than automatically linked by the system. In addition, if 
there are many different fields of common information, but only some of them are used 
for each family, the columns that store the information will be sparse. 

Improved Solution 

The improved solution maintains all of the benefits of the multi-table approach 
but eliminates the need for a lookup field in the primary table whose value identifies 
the id of the corresponding record in the lookup table, and simultaneously, the need for 
the user to manually place the id of the lookup record into this lookup field in each 
primary table record. Instead, the improved solution layers on top of the family 
partitioning hierarchy in such a way that the system creates and maintains all of the 
relationships automatically based on the membership of each group of primary table 
records in each family in the family hierarchy. 
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After the partitions have been defined by the user and the family partitioning 
hierarchy generated by the system, the next step is for the user to assign the common 
information for each family to the families corresponding to leaf nodes of the family 
hierarchy. Under this scheme, records in the primary table have already been grouped 
together into families, common information is then easily assigned to each family, and 
each new record in the primary table is automatically linked to the correct common 
information by virtue of its membership in the proper family. Moreover, for efficiency 
in storage, rather than store the data values in a fixed set of fields that exist for every 
family record, the data values are stored in a related, secondary table only on an as- 
needed basis so that, like attributes, they only take up space if they exist. 

Implementation and Method of Use 

See the discussion on families in the section entitled Catalog Manager Data 
Format. 

The Innovations 

Among the innovations are: (a) linking common information to families; (b) 
linking common information to each family rather than to the main table records by 
utilizing the family partitioning hierarchy; and (b) automatically creating and 
maintaining all of the relationships between existing and new main table records and 
common information based on family membership. 
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Structure for Automatically Formatting and Publishing Database Data 
Problem Statement 

Publishing information stored in an RDBMS as properly formatted presentations 

consisting of common information and tabular information for each group of related 

family records is relatively straightforward provided that the tabular layout format is 

relatively simple and the field structure comprising the information is uniform across 

the entire set of records in the DBMS. However, a problem arises when the tabular 

layout formats are more sophisticated, when each category of information consists of 

different data elements, and /or when each family requires its own distinct tabular 

layout format. In these cases, laying out each individual family requires more 

sophisticated algorithms and each family requires its own special handling, 

dramatically increasing the complexity of the publishing task. 

Known Solutions and Their Shortcomings 

Most RDBMS include report writers that provide an adequate platform for 
publishing information stored in the database as formatted presentations with simple 
tabular layout formats. The report writer can be easily coded to combine information 
from records in multiple tables, format the common information associated with each 
family in a structured way, and finally sort the records of tabular information. This 
approach works well with a relational database in which the field structure and the set 
of fields is consistent across the entire set of records, the field definitions are relatively 
static, and the number of fields is limited; because each field applies across the entire 
database, special handling and formatting for a particular field or fields is coded only 
once rather than multiple times. 
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By contrast, in a database with category-specific attributes, the field structure 
and set of fields differs for the records of each category. Thus, the report writer needs to 
be coded with specific intelligence about how to handle attributes on a category-by- 
category basis. If there are a large number of categories and attributes, this can be 
extremely tedious, time-consuming, and error prone to implement. And once 
implemented, the report writer must then be recoded each time changes are made to the 
taxonomy structure and/or the set of attributes associated with each category, which 
makes the report writer approach difficult to maintain as the taxonomy changes over 
time. 

Moreover, using existing report writers (or HTML) to present and structure 
tabular information more efficiently using pivot columns is a manual process that 
requires programming expertise that is beyond the capability of most average users. It 
is also data-dependent, so that even if the report writer can be used to create the pivot 
tables, the code then needs to be rewritten to reflect the data each time changes are 
made to the underlying records. Moreover, if each family requires a different tabular 
layout format because of category-specific attributes, then the particular tabular layout 
format for each family must be individually coded in the report writer, substantially 
increasing the coding complexity. 

Improved Solution 

The improved solution addresses and completely eliminates these coding 
challenges. It further extends the taxonomy structure to include not only the family 
partitioning hierarchy information but also to define additional layout information 
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about the structure and formatting of each presentation of family records. In particular, 
the taxonomy structure is extended for each family to include layout specifications that 
identify: (a) the fields and attributes on which to pivot the resulting sub-tables of 
records, reducing redundant information in each of the sub-tables; (b) the fields and 
attributes by which to sort the records in each sub-table; (c) the fields and attributes that 
should not be displayed in the published output; (d) the display sequence of the fields 
and attributes that have not been hidden nor used to pivot; and (e) pivot-specific sorting 
and display information to be applied on a pivot-by-pivot basis. This layout 
specification is performed and stored on a family-by-family basis so that not only fields 
but also category-specific attributes can be used to define the pivoting, sorting, display 
sequence, and other pivot-specific sorting and display characteristics for each family. 
Multiple pivots of the same type can be nested, while pivots of differing types can be 
combined. 

At the same time, the improved solution - for the first time - offers a WYSIWYG 
system that automatically generates and displays previews of the tabular layout formats 
based on the layout specifications without any report writer (or HTML) coding 
whatsoever. The records of a family are displayed in tabular form along with the 
participating fields and attributes. As each field or attribute is hidden or used to pivot 
or sort, and as each of the pivot-specific sorting and display characteristics are set, the 
corresponding table layouts for the family are automatically generated by the system 
and the preview display updated in real time, providing instant interactive feedback 
and allowing tweaking, tuning, and iterative refinement of the table layout of each 
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f amily, in sharp contrast to the manual approach that does not support this incremental, 
iterative layout process. 

The result is a system in which the report writer (or HTML) requires no complex 
code for pivoting tabular layout formats, no special coding for each category or family, 
and no intelligence about the underlying data. Instead, everything is driven by the 
extended taxonomy structure, and changes that occur in the taxonomy as well as the 
underlying records themselves are immediately reflected in the published output. In 
effect, the intelligence about how to lay out and format the records in each family is 
built into the taxonomy itself where it belongs rather than into special category- and 
family-specific programming code in the report writer. 

Another feature of this scheme is that, just as with the family partitioning 
hierarchy, the layout structure defined for each node is inherited by all nodes that are 
children of that node in the extended taxonomy. And once again, the inherited structure 
can be overridden on a node-by-node basis so that different children of a category can 
have different pivoting, sorting, display sequence, and other pivot-specific sorting and 
display characteristics. 

Specific pivot features and features for setting pivot-specific sorting and display 
characteristics include but are not limited to those listed in the table below: 



Feature 


Description 


Stack Pivots 


Add, remove, or inherit stack pivots for the current node 


Vertical Pivots 


Add, remove, or inherit vertical pivots for the current node 


Horizontal Pivots 


Add, remove, or inherit horizontal pivots for the current node 


Sorts 


Add, remove, or inherit sorts for the current node 


Hidden 


Add, remove, or inherit hidden fields and/or attributes for the current node 


Display Order 


Set, reset, or inherit the display order for the current node 


Column Names 


Set, reset, or inherit the display name of the Fields and attributes for the current node 
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Pivot Names 


Show or hide the name of the pivot column in the tabular layout format 


In Stack Pivot Header 


Display the stack pivot name in the stack pivot header 


In Vertical Pivot Column Header 


Display the vertical pivot name in the vertical pivot column header 


In Horizontal Pivot Row Header 1 


Display the horizontal pivot name in the horizontal pivot row header 


Span Above Horizontal Pivot Values 1 


Display the horizontal pivot name in a spanning row above the horizontal pivot values 


With Horizontal Pivot Values 1 


Display the horizontal pivot name in each cell with the horizontal pivot values 


Pivot Values 


Display combined pivot values as a single line or as multiple lines 


Column Titles at Top 2 


Display column titles once at the top of a stack pivot table 


Column Titles Interleaved 2 


Display column titles interleaved within a stack pivot table before each sub-table 


Separate Stack Pivot Tables 2 


Maintain each stack pivot sub-table as a separate table 


Sort Ascending 


Sort the column or pivot in ascending order 


Sort Descending 


Sort the column or pivot in descending order 


Sort in Natural Order 


Sort the column or pivot in natural order (text values only) 


Units 


Show or hide the unit of measure associated with each numeric value 


Merge Like Cells in Column 


Merge cells of the column with the same value into a single cell 


Promote Column if Same 


Display the column value with the common information if all cells have the same value 


Demote Column to Footnotes 


Display the column values as footnotes 


Hide Column if Empty 


Hide the column if all the cells are empty 



Horizontal pivots have three variants. 
Stack pivots have three variants. 
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Implementation and Method of Use 

See the discussion on families in the section entitled Catalog Manager Data Format. 
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Example 1. The Weiler End Brushes family before pivoting 

Example 2. The Weiler End Brushes family after stack, vertical, and horizontal pivoting 



*Cat Clicnl • AugasPubDcj 



FanJfHia.atdy V." 

rE'<)^rav.R *> 
*3'OAwa3!!,+> , w . 

£ OMrfwatwra • 
; 0<- n-bet Bo 



is OMa'* V ; 
li 4>v?tf« frr'rtl^ 





JhmltMfUJitM 




Utf[Uax| 


UoU 


HnSizatUoI 




J Mm 




2335 1«4 


UXCb 


$122 


2335 14GS 


0008b 


SU1 


tlttm 


2tttt 


2335 lO 


COOBn 


SSJ0 


2335 H78 


OJCb 


SSJ1 


22HM 


urn 


2X361472 


OJWfe 


JILfl 


2335 M74 


0122b 


San 


3m 


mm 


23351*75 


0O6n 


SUM 


2335 1471 


tuxeb 


S12J5 


23351460 


OJBfa 


S10J5 



Jl*.-V ,'toa 
tVJv jO-.V 

tC>* Six* [K*i] 



Prical ^j; 



60 



WO 02/25500 PCT/US01/29486 

The Innovations 

Among the innovations are: (a) extending the taxonomy structure to support 
automatic, formatted layout and publishing of presentations coupled with WYSIWYG 
generation and preview display of the resulting table; (b) abstracting and generalizing 
the concepts of stack, vertical, and horizontal pivots to support a wide variety of 
different table layouts; (c) abstracting and supporting the different variants of the stack 
and horizontal pivots; (d) extending the taxonomy with layout information that 
identifies fields and attributes on which to pivot and sort, which ones to hide, the 
display sequence of the remaining fields and attributes, and additional pivot-specific 
sorting and display characteristics; (e) allowing multiple layout items to be either 
combined or nested; (f) allowing cells with the same value to be automatically merged, 
columns with all the same value to be promoted, and columns that are empty to be 
hidden; (g) providing a WYSIWYG system that automatically generates and displays 
pivoted tabular layout formats in real time and allows the layout process to be 
interactive, incremental, and iterative; (h) the ability to specify pivot information at any 
level in the partitioning hierarchy, so that different nodes within a single partition can 
be laid out differently, and (i) the inheritance and overriding of inheritance of layout 
information in the partitioning hierarchy. 
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Media-Independent Publishing 
Problem Statement 

Publishing catalogs of product information to paper and to electronic media 

historically have been two very different and distinct processes, with a very different 

level and type of effort involved, and very different standards and expectations for 

quality. The challenge is to eliminate the distinctions between paper and electronic 

output and combine the best of both media in a way that brings to electronic catalogs 

the structure and high standard of quality typical of paper catalogs, and at the same 

time, dramatically reduces the cost of laying out paper catalogs by flexibly, 

prpgrammatically, and automatically generating page layouts in real time. 

Known Solutions and Their Shortcomings 

Paper catalogs are meticulously laid out with existing page layout programs a 
page at a time and formatted a table at a time by manually populating page layouts 
with product data, a process that is time-consuming, tedious and very, very expensive. 
There is also no easy way to experiment with different tabular layout formats and views 
of the data, and once a page has been laid out, it is difficult to add or remove records 
from the tables without destroying the structure of the page and requiring that it be laid 
out again (sometimes from scratch) which discourages updates and means that catalog 
pages tend to quickly become out-of-date. The upside of this complex process, however, 
is that manual page layout usually results in high page density, flexible and well- 
structured tabular layout formats using pivots to eliminate redundant information, and 
a very high overall standard of quality. Notwithstanding the high level of quality, 
however, it remains difficult to enforce a uniform look throughout a publication 
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because more than one person is usually involved in the page layout process, and each 
lays out pages somewhat differently. 

By contrast, electronic catalog pages are typically database-driven and generated 
programmatically in real-time. Since page layouts do not actually exist until the 
electronic catalog page is displayed, new products can be added and old products 
removed without disturbing the system or the published output. Unfortunately, the 
downside of this flexibility is that automatically generated electronic catalog pages are 
usually no more than wide, ugly, "spreadsheet-style" tables of data with redundant 
information, very little structure, and none of the sophisticated tabular layout formats 
that are standard for paper pages. With category-specific attributes and a large number 
of categories, it is even more impractical to have a customized hand-coded display for 
each family, so generic unstructured presentations are even more the norm. 

Moreover, when publishing to multiple media, none of the effort invested in 
meticulously laying out paper pages can be leveraged for the electronic catalog, since 
both the structure of the tabular layout formats as well as the product data are typically 
trapped within the page layout itself, while the electronic catalog requires that the data 
be stored and managed in a database to be searchable and generated in real-time. Thus 
the worlds of the two media are completely distinct and non-overlapping, very difficult 
to integrate, and require two distinct publishing efforts. 

Improved Solution 

The improved solution layers on top of the structure for automatically formatting 
and publishing database data described in the previous section, in which all of the 
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tabular layout formats that are typically stored in the page layout are now instead 
captured and stored in the database alongside the product data itself. In this scheme, 
the searchable, database-driven electronic catalog can not only serve up the product 
data but also the formatting data, preprocessed as to the structure of the pivoted, 
tabular layout formats, to be rendered in real-time by the report writer (such as ASP- 
generated HTML), in a data-independent fashion, even on a catalog with many 
categories and category-specific attributes. The report writer code itself (or HTML) need 
only handle the preprocessed pivot tables and requires no complex code for pivoting 
tabular layout formats, no special coding for each category or family, and no 
intelligence about the underlying data. Using the structure described in the previous 
section, electronic catalogs for the first time can now have the density and layout 
quality of paper catalog pages while maintaining their database-driven searchability. 

At the same time, the improved solution substantially eliminates the manual 
page layout process for paper catalogs. All of the time and effort invested in defining 
the appropriate tabular layout formats for each family now can be immediately 
leveraged for paper catalog publishing. Instead of requiring that the user manually 
populate page layouts with product data, the system automatically generates the page 
layouts by combining product data and formatting data from the database and then 
using the page layout program's API (for programs such as QuarkXPress or Adobe 
InDesign) or intermediate ASCII file format (for programs such as Xyvision XPP) to 
render pages automatically. This dramatically reduces the cost of laying out paper 
catalog pages, allows changes to the product data to be reflected immediately in 
subsequently generated output, supports the on-demand generation of custom catalogs 
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on product subsets with no additional effort, and results in a more uniform look 
throughout the publication since every page is being generated dynamically and 
automatically by the system. 

Each paper publication starts out as a snapshot of the family partitioning 
hierarchy and its associated formatting information. Any of the formatting 
specifications defined and stored in the family partitioning hierarchy and used for 
electronic catalog publishing can then be changed in any way for each paper 
publication, providing almost unlimited flexibility to create custom paper catalogs each 
based upon the electronic standard but laid out in a fashion that is as similar to or as 
different from any other catalog as necessary. In addition, the system offers the 
following per-publication flexibility: 

• A product mask can be applied when the snapshot is taken to limit the set of 
products appearing in the paper publication, so that each publication can have a 
different, custom subset of the entire product set (masks can also be applied 
electronically, and/or search parameters specified, to limit the set of products 
appearing in electronic output). 

• The order of the partitions in a publication can be rearranged when the snapshot is 
taken and set in any order (by contrast, partitioning order is fixed in the family 
partitioning hierarchy). 

• The sequence of the families in a publication can be rearranged in any order (by 
contrast, the family sequence is fixed in the family partitioning hierarchy). 

• A family can be copied from the family partitioning hierarchy into the publication to 
include families that were not initiallv included in the oublication. 
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• Each family can appear in multiple locations in the publication and each can be 
individually formatted, can include a different subset of the columns and common 
information, and can contain a different subset of the records in the family (by 
contrast, each family in the family partitioning hierarchy can appear only once, 
contains a fixed subset of the columns and common information, and contains all of 
the records). 

Additional features for paper publishing that allow publication-specific restructuring 
and reformatting of each family as well as the entire publication are listed in the table 
below: 



Feature 


Description 


Layout Detail 


Change any or all of the tabular layout format settings of the current node 


Column Names 


Change any of the display names for the current node 


Records 


Exclude any of the records of the current node, or include any that had been masked out 


Family Data 


Exclude any common information of the current node 


Refresh Options 


Include or exclude new records, columns, or common information 


Detail 


Display the criteria for the current node 


Format 


Specify additional formatting options 



The Innovations 

Among the innovations are: (a) layering both the electronic and paper publishing 
process on top of the same extended taxonomy structure for automatically formatting 
and publishing database data; (b) using tabular layout formats that are captured and 
stored in the database alongside the product data itself rather than stored in the page 
layout; (c) publishing high-quality output to the web using this layout information 
stored in the database; (d) using the page layout program's API or intermediate ASCII 
file format to render pages automatically; (e) allowing a product mask to be applied 
when the publication is first created; and (f) allowing the layout detail, the column 
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names, the set of records, and the common information to be individually customized 
for each family of a particular publication. 
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Catalog Manager Data Format 

Overview 

This document describes the current internal or low level database organization or 
schema for A2i Catalog databases. As such, it changes as a reflection of the growth or 
evolution of A2i products. The Catalog Manager Data Format (CMDF) document is 
confidential and proprietary to A2i, Inc. 

Databases 

On a given A2i Database Server a global database contains a list of all A2i Catalogs on 
that machine. The global database is always named A2i_xCat_DBs. Within it is a table 
that holds the logical or publicly known names of catalogs and the actual database 
names used for storage. 

Three databases are used to represent each catalog. 

• Base database that holds everything but image or Large Object data. 

• Originals database that holds the bitmap data for the original imported images. 

• Thumbnails database that holds the scaled down 200x200 bitmap data of the original 
imported images. 

On Oracle servers, there needs to be a sequence called A2IJSEQUENCE starting at 1 
incrementing by 1 for each of the 3 databases. 



Catalog Table 

There is a single table called 
_A2i_CM_Servers_ 



SQL field 
name 


SQL Field Type 


Description 4 


CatalogName 


Varchar 128, not 
NULL 


Logically or publicly known Name of an A2i 
Catalog. 


MainDB 


Varchar 30, not NULL 


Name of the database for most non-binary 
data 


OrigDB 


Varchar 30, not NULL 


Name of the database for original binary data 


ThumbDB 


Varchar 30, not NULL 


Name of the database for scaled down image 
data 


VariantDB 


Varchar 30, not NULL 


Name of the database for image variant data 


Datel 


Date, NULL 


Date/Time field for future or miscellaneous 
use 


Description 


Varchar 255, NULL 


Miscellaneous use 
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Create a Unique valued index on CatalogName 

Each A2i Database Server may differ from other DB Servers. Any parameters or settings which 
are modified for an individual DB Server are maintained in the A2i_xCat_DBs database in a 
settings table. 



Settings Table 

There is a single table called 

_A2i_CM_Settings_EiTor! Reference source not found. 



SQL field 


SQL Field Type 


Description 


name 






Name 


Varchar 128 


The name of the parameter. 


Setting 


Varchar 128 


The value the parameter is to take. 



Create a Unique valued index on Name 



At the present time there are two settings: 

DataPath The directory location where DB data files are to be created. 

BackupPath The directory location where backup files are to be created. 

Each Catalog has a table with a single record that is used to hold for state information 



Server Table 

There is a single table called 

_A2i_Server_ that requires exactly 1 record. 



SQL field name 


SQL Field Type 


Description 


ServerName 


Varchar 50, not 
NULL, default 
empty string 


Name of xCat Server (XCS.exe) that is 
currently using this SQL database. The 
XCS.exe program fills this in. 


StartupTime 


Datetime, not NULL, 
default any time 


time the current server connected to this SQL 
database 


LastCheckln 


Datetime, not NULL, 
default any time 


last checkin time, the current server checks 
in every minute. 


FamilyCatFieldld 


Int 4, not NULL, 
Default 0 


main table field Id of the fields used as the 
base field in the family table. If no family 
table exists, this will be 0. 


DBSchemaVersio 
n 


Int 4, not NULL, 
default 0 


Revision number of the database schema or 
structure. High order short integer is major 
version, low order short integer is minor 
version. XCS uses this to determine if it must 
upgrade the database structure. 



69 



WO 02/25500 



PCT/US01/29486 



LockVersion 



Int 4, NULL 



Used by administrative console program to 
lock database for structure changes 



Tables Table 

This table contains the descriptions of all Primary Data Tables. Primary Tables have the 
name _A2i_x_ where x is a number starting at 1. The Primary Tables table has the 
following name: 

_A2i_CM_Tables_ 

Note: Every entry in this table represents a Primary Table in the database. There is no 
need for a null entry. 



The Field Structure is as follows: 



SQL field 
name 


SQL Field 
Type 


Description 


Tableld 


Int 4, 

not NULL, 
Primary Key 


Id defining the table. 

First valid Id is 1. Unique. 

Actual table name is _A2i_x_ where x is the Id. 


TableName 


Varchar 50, 
not NULL 


User readable name for the table. Not null or empty. 
Must be unique when converted to upper case and all 
whitespace is removed. 


TableType 


Int 4, not 
NULL 


Type of table, valid values are: 
0,1,2,4,11 

Refer to the Table Type Schedule for a list of TableType 
values and a description of each. 


Lookup 


Varchar 50, 
not NULL 


This is a text version of the field Id in this table that 
specifies which field represents the entire record. This 
field will replace the Id references in the linking table. 
So if the main table field indicates id's 3,4,5 the Lookup 
field from records with Id's 3,4,5 will be displayed in 
place of the numbers. 

The format is "Bid, so if the field Id of the lookup field is 
100 the value of this field will be F100 


Params 


Varchar 255, 

NULL 

allowed 


MainTable : Id of associated MaskTable 
MaskTable : Id of associated MainTable 
HierAttrTable: Id of image object table 


Attributelmag 
eTableld 


Int 4, not 
null, default 
0 


Image Lob Data Table Id associated with this category 
table the images for its attributes. 


FVImageTabl 
eld 


Int 4, not 
null, default 
0 


Image Lob Data Table Id associated with this category 
table the images for its feature values. 


NextAutoId 


Int 4, not 
null, default 


Tracks the next available Id field for the Autold field 
type (not yet implemented, but necessary in the 
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1 1 structure) 

Create a Primary, Unique Valued, Clustered Index on Tableld 
NOTE: the Views field has been removed. 



Table Type Schedule 



TableTyp'e 
• Value ■ 


Primary 
or Lob , 


•'■''[. . . . TableType Description- 


0 


P 


MainTable 


1 


p 


FlatTable, essentially the same as MainTable except it cannot 
currently have an associated Mask Table. 


2 


P 


HierTable, Like Flat table, but every record has a parent record. The 
Id of the top-level node is 0. This table type is usually displayed in a 
tree format. 


4 


P 


HierAttrTable, Hierarchy table having associated attributes. Each 
record in this table can be linked to 0 or more attributes. Children 
inherit their parent's links. Main Table records can be linked to leaf 
nodes in this table. 


I '5 


L 


TextDataTableType, 


6 


L 


ImageDataTableType 


7 


L 


SoundDataTableType 


8 


L 


VideoDataTableType 


10 


L 


ExtDataTableType 


11 


P 


MaskTable. This looks like a hierarchy table, except it always has a 
Mask field which contains a BitVector specifying which records in its 
linked MainTable apply to its own records. 


18 


L 


PDFDocumentTable 



Table Type Schedule 



71 



WO 02/25500 



PCTVUS01/29486 



Field Type Schedule 



Field. 
Type 
Value 


. : Held T^e. : Description ; ■- 

i ' "'>:■::•"': • "• '•. '";*." 


, V .;E)BM3 : Mapping 


SQL 
Server 


Oracle' ; 


0 


Integer Field, default value is NULL 


Int4 


Number 
(10) 


1 


Real4Field, allow NULLs, default value is NULL 


Real 4 


Float 


2 


CurrencyField, allow NULLs, default value is NULL 


Real 4 


Float 


3 


DateField, allow NULLs, default value is NULL 


Datetim 
e 


Date 


4 


TimeField, allow NULLs, default value is NULL 


Datetim 
e 


Date 


5 


BoolField, allow NULLs, default value is NULL 


Tinyint 


Number (3) 


6 


FixedWidthText, not NULL, default value is empty string 


Char 


Char 


7 


FlatSubTableField, holds Id of record in a separate FlatTable. 
The Lookup field for the record with the specified Id will be 
displayed. Used to allow the field to be parametrically 
searched upon. Not NULL, default value is 0 


Int4 


Number 
(10) 


8 


HierSubTableField, same as FlatSubTableField, but links 
toHierTable. 


Int4 


Number 
(10) 


9 


INVALID, previously FlatAttrSubTableField 






10 


Hier AttrSubTableField, same as HierSubTableField but 
with attributes. Not NULL, default value is 0 


Int4 


Number 
(10) 


11 


FlatMultiSubTableField, this indicates this field will 
contain 0 or more links to a FlatTable. Being a virtual 
field, there will not be an actual field in the _A2i_x_ 
table. Values are stored in a separate table. Needed for 
data normalization and SQL query generation. SQL, no 
actual field. 







12 


TextField, contains Id of single large text block, not 
NULL, default 0 


Int4 


Number 

do) ! 


13 


MultiTextField, contains 0 or more ids of large text 
blocks. This is a virtual held in that there will not be an 
actual field in the _A2i_x_ file. The values will be 
stored in a separate table _A2i_x_f_ where x is the value 
of this table Id and f is the value of this field id. 


Int4 


XT 1 

Number 
(10) 


14 


ImageField. This contains the Id of a single image, not 
NULL, default 0 


Int4 


Number 
(10) 
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15 


MultilmageField. This indicates this field will contain 0 
or more image ids. This is a virtual field in that there 
will not be an actual neia in tne _/\zi_x_ nie. 1 ne 
values win oe storeu m a &eparcue tauic _-r%^_A_i_ 

T ..L^ n v :„ f"U 0 trail ia of fViic fnVilp TH ar»H f iq fbp valup of 

wnere x is une varue ui uuo iciuie ±u aim i io mc »cuuc w 
tnis neia la. 


Int 4 


Number 
(10) 


lo 


bOUnarieia, INOt lei lmpicuiciutru ^in ixj 






1/ 


Muiubounaneia, in n 






1 Q 

lo 


viaeorieia, in ii 








Muiu viaeorieia, in 11 






on 


Keservea lor jruuire udc 






21 


Reserved for Future Use 






22 


ExtField,NYI 






23 


MultiExtField, NYI 






24 


Name field, lext string containing text ana coaes to 
represent Title, First, Middle, Last, suffix for name. 
Format to be determined. SQL Varchar, not NULL, 
default empty string 


V dlU la. 

r 


V (XX V-l l«i 


25 


MeasurementField used when a number is not 
descriptive enough. Examples are length or 
temperature. This will generate 2 fields in the _A2i_x_ 
table, rx and Ux. rx is tne actual vaiue as a reai <± anow 
NULLs; Ux is the unit of measure int 4 allow NULLs, 


real 4 

0 r% A 

int 4 


Float 

o-n H 

Number 


26 ! 


TimeStampField, used when both date and time are 
needed. Allow NULLs, default value is NULL 


Datetim 
e 


Date 


27 


NormalizedTextField, special type of fixed width text 
field that sorts, and searches based on the normalized 
version of the string it contains. Normalization 
currently removes all non alpha-numeric characters. 
NOTE, the value it this held is tne actual string vaiue 
containing non-normalized characters, i.e. "l2-34.b/56", 
however it sorts ana searcnes as if me string were 
"1234b56" / not NULL, default empty string 


Varcha 
r 


Varchar2 


28 


RealoField, o byte floating point number, anow in ull,s, 
default is JNULL 


l\CuX O 


Float 

x lis a. 1 


29 


rlierJVlultiDUD 1 aDieJrieia, same as riauviuitiDuu 1 auicrieiu 

^11 J DUt trie LaUlc ll lUU\a IU 1& d iuciaiLJ.Ly lauic 








internal type, not oiiowcu as anuai nciu. 






31 


MultiTemplateField, NYI 










PDFDocumentField. NYI 






33 


MultiPDFDocumentField, NYI 






34 


AutoIdField, not NULL, Create a Unique index on this 
field 


Int 4 


Number 
(10) 
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35 


LargeTextField, allow NULL (Oracle: default 
Empty_Clob()) 


Text 


CLob 


36 


LogField, allow NULL (Oracle: default Empty_Clob()) 


Text 


CLob~ 


37 


Multi Measurement Field. This indicates this field will 
contain 0 or more Measurements(V alue, Units). This is 
a virtual field m that there will not be an actual field in 
the _A2i_x_ file. The values will be stored in a separate 
table _A2i_x_f_ where x is the value of this table Id and 
f is the value of this field id. 







Field Type Schedule 
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Fields Table: 

Describes every non-Id field in all the primary tables in the database. Id fields are 
assumed to always exist in every primary table and have the field name Id. They are 
not included in this table. The fields table has the following name 

_A2i_CM_Fields_ 
All entries refer to fields; there is no need for a null entry. 

Note, the Description of FieldType specifies the SQL field types for the primary tables. 
The Field Structure is as follows 



SQL field 
name 


SQL Field 
Type 


Description 


Permanentld 


Lit 4, not 
NULL, 
Identity (1, 
1) 


Ever increasing Id used to make sure newly added records 
are not confused with previous records that had the same 
id. Oracle does not need the Identity(l,l) descriptor. 


Fieldld 


Int4, 

not NULL, 

Primary 

Key 


Id defining the field. First valid Id is 1. Unique. Actual 
table field names are Fx where x is the Id. Exceptions: 
Field type 11 has no physical field in the _A2i_x_ table 
Field type 25 has an additional Ux field to spedfy type of 
units. 


Tableld 


Int4, 

not NULL 


Table Id to which this field belongs 


FieldName 


Varchar 128 


User readable name for the field. Not null or empty. 
Must be unique among all FieldNames for a given Tableld 
when converted to upper case and all whitespace is 
removed. 


FieldType 


Int4, 

not NULL 


Type of field, valid values are 0 to 36 

Refer to the Field Type Schedule for a list of FieldType 

values and a description of each. 


FieldParams 


Varchar 255 
not NULL, 
default "" 


This provides extra information for some of the above field 
types. If any of these fields require spaces after the last non- 
space character, you may enclose this field in "" marks. 
These double quotation marks are stripped off when this 
field is read in to the xCat server. The field types and their 
format are: 
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2 - (CurrencyField), the number of decimal places, whether 

to allow fractions (0, 1) and the actual currency 

symbol(s) to preceed the currency amount, examples: 

2 / 0/ , eur " — (2 decimal, no fractions) 
3,1,"$" — (3 decimal, allow fractions) 


5 - Boolean Field 

{True String} 1 {False String} 1 T or F 

Example, Red 1 Blue 1 T means the True String is Red, 

False String is Blue and the default is the True String 

(Red). 


6 , 27 (FixedWidthText, NormalizedText)- number of 
characters 


7,8,10 - _A2i_x_ primary tableld that this field links to. 
Optionally followed by comma and the default value for 
new records. Tableld[, default value]. If no value is 
specified or the value is not a valid record, the default 
value is 0 


11,29 - _A2i_x_ primary tableld that the id's in this field 
refer to. Being multi-valued fields, the default value is 
always none. 


12,14,16,18,22,32 - _A2i_data_x_ object data table Id that this 
field refers to. 


13,15,17,19,23,31,33 - _A2i_data_x_ object data table Id that 
this field refers to. 


25, 37 - decimal places, allow fractions, Measurement Type 

Id, Defaults Units of Measure Id. E.G. 

3,1,1,1 — 2 decimal places, allow fractions, meas id 1, units 

idl 

-1,0,0,0 — max float decimal places, no fractions, meas 0, id 
0 


3,4,26 DateField, TimeField, and TimeStampField. To use 
the current time as the default value put a T here, otherwise 
leave it blank or put an F here. Valid values are T, F or 
nothing at all. 


Position 


Int 4, not 
NULL 


Holds the default display position for fields within a table. 
Beginning with 0, each position is unique per Tableld. Each 
user can override the default at his workstation. 


UselnKeywor 
d 


Bit 1, not 
NULL, 
default 0 


Determines whether it can be used in keyword searches 
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Create a Primary, Unique Valued, Clustered Index on Fieldld 
Create a Unique Valued Index on Tableld, Position 
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Direct Data Tables 

Primary Tables 

These tables contain all non-attribute-related information. 

These tables are named: 
_A2i_x_ 

where x is the Id specified in the Tableld field of _A2i_CM_Tables_. In typical usage, 
when there are multiple tables having this form, one table is considered the main table 
with the remainder acting as sub tables used for multi-values, etc. However there is no 
theoretical limit to the number of main tables. 



Every primary table has the following fields 



SQL field name 


SQL Field Type 


Description 


Id 


Int 4, not NULL, Primary 


Id of the record. Valid Id's start 




Key 


at 1 for new records 


Permanentld 


Int 4, not NULL, Identity (1, 
1) 


Ever increasing Id used to make 
sure newly added records are 
not confused with previous 
records that had the same id. 
Oracle does not need the 
Identity(l,l) descriptor. 



Create a Primary, unique valued, clustered index on Id 



Every primary table has a permanent NULL record with Id = 0 and all fields set to the 
default value for that field type. See the description of the _A2i_CMJFields_ table for 
the default values. When creating a database, make sure all _A2i_x_ tables have this 
NULL record. 

This NULL record is needed because any table can be a lookup for another table. On 
initial record creation for a table, all fields must contain valid values. This means all 
lookup fields must link to an actual record in another table. By default they link to this 
empty record. This maintains a valid database even if the lookup fields are not 
changed. 

Other fields have names Fx where x is the Fieldld specified in the _A2i_CMJFields_ 
table. The types of these fields are specified in the _A2i_CM_Fields_ table. We have 
several reasons to use field names Fx instead of more human friendly names like 'Color 
field'. 

1. Performance. We only need to know the Id of a field to access it. This results in less 
storage in the server and client components and small network packets. It also 
speeds up the search for a particular field. 
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2. Cross Database independence. This format is valid for SQL databases, Codebase, 
MS Access or any other standard database system. We use each database simply as 
a container. By restricting the field names, we guarantee that all names will comply 
with naming conventions on the various database systems used. 

Some exceptions to this general naming of fields Vx are as follows: 

1. Any multi-valued fields, Field type 11 (FlatMultiSubTableField) and 29 
(HierMultiSubTableField) and object data fields 13,15,17,19,23,31,32,33 do not have 
physical fields in the _A2i_;t_ table 

2. Field type 25 (MeasurementField) has an additional field named Ux (where x is the 
Fieldld) used to specify the type of units used. 
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Mask Tables: 

A Mask table is a special type of primary hierarchical table with an additional field 
called Mask. Following the same rules, it is named: _A2i_*_ 
This additional Mask field stores the bits of a BitVector to track record Ids in another 
linked table. It is like a sub-table in that each of the records in this table correspond to 
multiple records in a linked main table, however the link is stored in this table as a 
mask instead of in the main table as a category field. For example, a record in the mask 
table with the mask having bits 1,2 and 10 set means that this record corresponds to 
records in its linked main table with record ids 1, 2 and 10. 

In the _A2i_CM JTables_ table, the mask table entry has a type of 

11 = MaskTable 
and the Params field is the table Id of the linked main table. 
The main table has its Params field set to the Id of the mask table 



Similar to other primary tables, every mask table has a standard Id field and also has 
any fields specified for it in the _A2i_CM JFields_ table. The additional Mask field, 
described below, differentiates it from other primary tables. 



SQL field name 


SQL Field Type 


Description 


Mask 


Image 16, not NULL, default 


Bit field of the corresponding 




value (0x00) 


record ids in the linked main 






table 
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Hierarchy Tables: 

A Table of type HierTable and HierAttrTable table relies on an additional table to 

describe the hierarchy relationship. The table is named: 

_A2i_H_*_ 

where x is the Tableld of the HierTable or HierAttrTable 



The structure of this table follows: 



SQL field 
name 


SQL Field Type 


Description 


Id 


mt 4, not NULL, 
Primary Key 


Id of an existing Id in the _A2i_x_ HierTable or 
HierAttrTable, where x is the same value as in 
this tables name 


Parentld 


Int4,notNULL 


Parent Id. Must specify an existing Id in the 
A2i_x_ table, where x matches. 


Position 


Int4,notNULL 


Position of this node under its parent. First 
position is 0. No missing positions are allowed, 
it must be 0,1,2,3 and so on. If a node is 
removed all children after it must have their 
positions decreased by 1. If a child is inserted, 
all nodes after it must have position increased 
by 1. 


ShowChildren 


Bit, not NULL 


Determines whether the descendents of this 
node should be displayed in the search lists, 
and replaced with this node's name in the 
result list. Set to 1 means children are shown, 
set to 0 means children Eire hidden and 
replaced 


OriginaUd 


Int4,notNULL,use0 
as default to convert 
existing databases 


Id of original record that this node is an alias 
of. OriginaUd = 0 means this is an original 
node. 



Create a Primary Key, Unique Valued, Clustered Index on Id. 
Create a Unique Index on Parentld + Position. 



This table should contain a master parent record with: 
ld = 0 

Parentld = -1 
Position = -1 
ShowChildren = 1 
OriginaUd = 0 

All top-level nodes will then use this record as their parent. 
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Multi Value Tables; 

Fields with type 11 (HatMultiSubTableField), 29 (HierMultiSubTableField), object data 
fields 13,15,17,19,23,31,32,33 and MultiMeasurementField 37 do not have physical fields 
in their data table. The lookup Ids are stored a separate multi-value table. 

The multi-value tables are named 
_A2\_xJ_ 

where x is the Tableld of the table containing a multi-valued field, 
/is the Fieldld of the field. 



The structure of this table follows for type 11,29,13,15,17,19,23,31,32,33: 



SQL field 
name 


SQL Field Type 


Description 


Id 


Lit 4, not NULL, 
Primary Key 


Id of an existing Id in the _A2i_x_ where x is the 
same value as in this table's name. 


Subld 


Int4,notNULL 


Subld. Must specify an existing Id in the lookup 
table A2i_n_ or A2i_Data_n_ . n is a simple number 
taken from A2i_CM J?ields_.FieldParams where 
A2i_CM_Fields_.FieldId = /and is dependent on 
A2i JZMJFields_.FieldType being on of several Multi- 
Value Field Types [ via the IsMultiValuedFieldQ test] 


Create a non-Unique, clustered index on Id. 
Create a Unique Index on Id, Subld 
Create a non-Unique Index on Subld 


The structure of this table follows for type 37: 


SQL field 
name 


SQL Field Type 


Description 


Id 


Int4, not NULL, 
Primary Key 


Id of an existing Id in the _A2i_x_ where x is the 
same value as in this table's name. 


Value 


real 4, not 
NULL 


Measurement Value 


Units 


mt 4, not NULL 


Measurement Units Id 


Position 


Int4,notNULL 


Position in list of value for this Id 



Create a non-Unique, clustered index on Id. 
Create a Unique Index on Id, Position 



The reason the multi value lookup fields are stored in a separate table was to normalize 
the database to allow for SQL queries to search on multi value criteria and to return 
results stored in multi value fields. 
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Attribute Tables 



Attributes 

What is an Attribute ? 

An attribute is a parameter used to classify and describe a record, (i.e. 'screen size' of a 
monitor). It is similar to a category but only applies to subset of the entire record set. If 
it applied to all records it would simply be a category. This means that one group of 
records will have one set of attributes describing them, while another group of records 
will have completely different attributes describing them. An example is 'screen size' of 
a monitor and 'processor speed' of a computer. Both monitors and computers are 
records in the same table but they have different attributes describing them. 

How Attributes Relate To Records: 

Attributes apply to groups of records. A group of records is specified by creating an 
Hier AttrSubTableField in the main primary table and setting the value of this field to 
the Id of a record in a table of type Hier AttrTable. For Example, an 
Hier AttrSubTableField called 'SampleCategoryField' can be created in the main 
primary table, and another primary table of type Hier AttrTable called 
'SampleCategoryTable' can be created. One record in the 'SampleCategoryTable' may 
be a record describing the 'Monitor' category. Now all records in the main primary 
table with 'SampleCategoryField' linked to the record describing the 'Monitor' category 
in 'SampleCategoryTable' are in the Monitor group. 

Attributes are assigned to a group by linking them to a set of records in a table of type 
HierAttrTable. Continuing the example, an attribute called 'Screen Size' can be linked 
to the record in 'SampleCategoryTable' that describes the 'Monitor' category. Now all 
records in the main table with 'SampleCategoryField' that link to the record describing 
the 'Monitor' category in "SampleCategoryTable' will have the 'Screen Size' attribute. 

NOTE: For each table of type HierAttrTable, only 1 field of type Hier AttrSubTableField 
in the entire database can link to it. 

An attribute is either Text or Numeric. Previously these were referred to as Feature (for 
Text) and Characteristic (For Numeric). The naming of fields and tables still refers to 
Features and Characteristic: 

A Text Attribute is an enumerated list of Text Values. An example is "Valve Type". 
There is a small finite set of valve types. 

A Numeric Attribute is continuous. An example is length. Although you could 
enumerate all lengths in a list of products you gain certain advantages by treating it as 
Numeric. One is searching by range (not yet implemented). Another is the ability to 
convert between units (feet to meters). 



83 



WO 02/25500 



PCT/US01/29486 



Attribute Definition Tables: 

These tables contain the definitions of all attributes in the database. 

They are named 
_A2i_A_x_ 

where x is the Tableld of the Hier AttrTable that contains all the categories that 
these attributes are allowed to link to. 



The structure of this table follows: 



SQL field name 


SQL Field 
Type 


Description 


Attrld 


Int4, 

not NULL 


Id defining the Attribute. First valid Id is 0. 
Cannot repeat within this table. 


Permanentld 

• 


Int 4, not 
NULL, 
Identity (1, 
1) 


Ever increasing Id used to make sure newly 
added records are not confused with previous 
records that had the same id. Oracle does not 
need the Identity(l,l) descriptor. 


AttrName 


Varchar 128, 
not NULL 


Human readable name. This will be displayed 
when searching or viewing records 


AttrType 


Int 4, 

not NULL 


Determines if this is a Feature (Text) or a 
Characteristic (Number) and what values the 
attribute contains. Use bitwise OR on the 
following values to generate the AttrType. If 
any flags are set, this attribute is a 
Characteristic, otherwise it is a Feature 

1- Minimum 

2 - Maximum 
4 - Typical 

8 - Nominal (most common) 
16 - Average 


AttrDefn 


Varchar 255, 
not NULL 


Long description of the attribute 


AttrAlias 


Varchar 128, 
not NULL 


Not used yet, leave blank 
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AttrParam 


Int4, 

not NULL 


For Characteristics this determines the 
measurement type. 

1- length 

2 - weight 
For Features this determines whether the 
attribute is single or multi select 

0 - single select 

not 0- multi select 


DecimalPlaces 


Tinylnt 1, 
not NULL 


For Characteristics this determines the number 
of places after the decimal to display . The 
default is 3, meaning that the value 0.0255 will 
be displayed as 0.026. The specified number of 
places are always used so that the number 4 
will be displayed as 4.000 


AllowFractions 


Bitl, 

not NULL, 
default 0 


?? 


MeasurementType 


bit 4, 

not NULL 


For Characteristics this determines the 
measurement type of the value. Possible values 
are: 

0 = None 

1 = Length 






UnitsOfMeasure 


Int4, 

not NULL 


For Characteristics this determines the default 
units of measure of the value. This is the value 
that is automatically filled in when you set the 
attribute value from the catalog client. Also 
when you change the MeasurementType of an 
attribute, the first units of measure that you 
select will automatically overwrite the current 
units for all data values of this attribute. Its 
interpretation depends on the value of the 
MeasurementType field: 


Imageld 


Int4, 

not NULL 


Image Id of the image for this attribute. The 
Image Table Id is contained in the tables' table 
Params field for the associated category table. 


IsMultiValued 


Bit, not 
NULL 


Indicates whether this Attribute record ... 


CoupledAttrName 


Varchar 128, 
not NULL 




CoupledDecimalPlace 
* 

s 


Tinylnt 1, 
not NULL, 




CoupledAllowFractio 
ns 


Bit, not 
NULL 
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CoupledMeasurementT 
ype 


Int 4, not 
NULL 




CoupledDefaultUnitsOf 
Measure 


Int 4, not 
NULL 




CoupledSymmetricSear 
ch 


Bit, not 
NULL 





Create a Unique Valued, Clustered Index on Attrld 
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Category Attribute Linkage Tables: 

These tables determine which attributes apply to which categories (categories are 
records in a table of type HierAttrTable). By creating a record in this table you link an 
attribute to a category. All records in a separate table linked to that category will be 
described by the Attribute. 

The names of these tables are 
_A2i„CA_x_ 

where x is the Tableld of the HierAttrTable that contains all the categories that 
the Attributes are allowed to link to. 



The structure of this table follows: 



SQL field 
name 


SQL Field Type 


Description 


Id 


Int4,notNULL, 
part of Primary 
Key 


Id defining the Category. Must specify an 
existing Id in the _A2i_x_ FlatAttrTable or 
HierAttrTable, where x is the same value as in 
this tables name 


Attrld 


mt 4, not NULL, 
part of Primary 
Key 


Attribute Id. Must specify an existing Attrld in 
the A2i_A_x_ attribute definition table, where x 
matches. 


Priority 


Int4,notNULL 


Priority of this attribute link. Lower numbers 
cause this attribute to appear higher in the list of 
all attributes linked to this category or any of its 
descendants. Valid values are 1 to 100, default 
50. 



Create a Non-Unique, Clustered Index on Id. 

Create a Primary, Unique Valued Index on Id + Attrld. 

Create a Non-Unique Index on Attrld. 



What is Attribute Priority? This number ranks the attributes linked to a particular 
category according to importance of display. 

When a single category is selected in a Search Pick List, the attributes linked to that 
category and all of its ancestors are shown. Attributes with lower priorities are shown 
first. Attributes with the same priority are sorted by Attribute Name. 

When a result set of records all having the same category is displayed. The attributes 
are displayed as above. 

We don't yet know what to do if the records have different categories because that 
could cause the same attribute to be linked with two different priorities. 
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Feature Values Tables 

These tables determine the possible Text Values for all Text Attributes relating to a 
specific category table. 

The names of these tables are: 
_A2i_FV_z_ 

where x is the Tableld of the Hier AttrTable that contains all the categories that 
these attributes are allowed to link to. 



The structure of this table follows: 



SQL field 
name 


SQL Field Type 


Description 


Attrld 


Int 4, not NULL, 
part of Primary 
Key 


Attribute Id. Must specify an existing Attrld in 
the A2i_A_x_ attribute definition table, where x 
matches. 


jrcaiuJ.t.JLU 


Tnt 4 not MTJT L 
part of Primary 
Key 


Td dpfininfy thp pnumpratpd valup Thp Ids start at 
0, and should only be unique for all records with 
the same Attrld. Records with different Attrlds 
should start again at 0. 


Permanentld 


Int 4, not NULL, 
Identity (1, 1) 


Ever increasing Id used to make sure newly 
added records are not confused with previous 
records that had the same id. Oracle does not 
need the Identity(l,l) descriptor. 


FeatureName 


char 128, not 
NULL 


Human readable description of this attribute. 
Examples are 'white', 'air valve', 'Pentium II' 


Imageld 


Int 4, not NULL, 
default 0 


Image Id of this feature's image 


Position 


Int 4, not NULL, 
default 0 


position of this text value in the display of all text 
values. Starts at 0 and cannot have gaps, unless 
all values are 0. If all are 0, the server will set the 
values to the natural order when you rebuild 
indices. This allows you to easily convert old 
database. 



Create a Non-Unique, Clustered Index on Attrld. 

Create a Primary Key, Unique Valued Index on Attrld + Featureld. 

Create Unique Valued Index on Attrld + Position 



Note : Featureld should only be unique for records with the same Attrld. Each time the 
Attrld changes, start Featureld at 0 again. This allows us to use smaller structures to 
store the Feature Id's in memory resulting in less memory usage and faster searches. 
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Feature Entries Tables 

This is where all the Feature data is. These tables store the actual Text values selected 
for a particular Feature Attribute of a particular record. 

The names are: 
_A2i_F_x_ 

where x is the Tableld of the Hier AttrTable that contains all the categories that 
the attributes are allowed to link to. 



The structure of this table follows: 



SQL field 


SQL Field 


Description 


name 


Type 




Id 


Int 4, not 
NULL 


Main Product Id. Must specify an existing Id in the 
_A2i_x_ primary table, where x matches this table. 


Attrld 


Int 4, not 
NULL 


Attribute Id. Must specify an existing Attrld in the 
_A2i_A_x_ attribute definition table, where x matches. 


Featureld 


Int 4, not 
NULL 


Defines the enumerated value. Must specify an 
existing Featureld in the _A2iJFV_x_ Feature 
Enumerated Value table. 


Position 


Int 4, not 
NULL 


The ordering position for multiple features of a 
record. Beginning with 0, each position is unique per 
Id. 



Create a non-Unique Valued, Clustered Index on Id, very important for performance 
Create a Unique Valued Index on Id + Attrld + Featureld 
Create a non-Unique Valued, Index on Id + Attrld 
Create a non-Unique Valued, Index on Attrld + Featureld 



A record in this table indicates that for the record matching Id, its Attribute matching 
Attrld has the Text Value matching Featureld. 
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Characteristic Entries Tables 

This is where all the Numeric Attribute data is. These tables store the actual Numeric 
values selected for a particular attribute of a particular record. 

The names are: 
_A2i_C_x_ 

where x is the Tableld of the HierAttrTable that contains all the categories that 
the Characteristic Attributes are allowed to link to. 



The structure of this table follows: 



SQL field 
name 


SQL Field Type 


Description 


Id 


frit 4, not NULL, 
part of Primary Key. 


Main Product Id. Must specify an existing Id in 
the _A2i_x_ primary table, where x matches this 
table. 


Attrld 


frit 4 not NULL, part 
of Primary Key 


Attribute Id. Must specify an existing Attrld in 
the _A2i_A_x_ attribute definition table, where 
x matches. 


CharType 


Tinylnt l,not 
NULL, 

part of Primary Key 


Characteristic type. Must be exactly one of the 
possible flags set in the AttrType field of 
the_A2i_A_x_ attribute definition table for the 
attribute with Attrld equal to the previous 
field's value. There should be one record in this 
table for each flag set in the attribute definition 
table for the Attribute defined by Attrld for 
every main product Id 


Value 


Real 4, not NULL 


Actual value of this attribute. For example 3 
1/4 inches would be 3.25 


Position 


frit 4, not NULL 


The ordering position for multiple attributes of 
a record. Beginning with 0, each position is 
unique per Id. 


Units 


frit 4, not NULL 


Type of units for the Value field above. This is 
an enumeration whose valid values and 
descriptions depend on the AttrParam field in 
the attribute definition table for the attribute 
with Attrld. Currently these are: 
If AttrParam = 
0 (none), Units can be 

0 - none 
l(length), Units can be 

1 = mm 

2 = inches 
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Create a non-Unique Valued, Clustered Index on Id, very important for performance 
Create a Unique Valued Index on Id + Attrld + CharType 
Create a non-Unique Valued, Index on Id + Attrld 

Coupled Numeric Entries Tables 

This is where all the Coupled Numeric Attribute data is. These tables store pairs of 
actual Numeric values selected for a particular attribute of a particular record. 

The names are: 
_A2i_CNx_ 

where x is the Tableld of the HierAttrTable that contains all the categories that 
the Characteristic Attributes are allowed to link to. 



The structure of this table follows: 



SQL field 
name 


SQL Field 
Type 


Description 


Id 


Int 4, not 
NULL 


Main Product Id. Must specify an existing Id in the 
_A2i_x_ primary table, where x matches this table. 


Attrld 


Int 4 not NULL 


Attribute Id. Must specify an existing Attrld in the 
_A2i_A_x_ attribute definition table, where x 
matches. 


Value 


Real 4, not 


Actual value for the left side this attribute. For 




NULL 


example 31/4 inches would be 3.25 


Units 


Int 4, not 
NULL 


Type of units for the Value field above. This is an 
enumeration whose valid values and descriptions 
depend on the MeasurementType field in the 
attribute definition table for the attribute with Attrld. 
See the current units schedule for a list of unit types. 


CoupledValue 


Real 4, not 
NULL 


Actual value for the right side of this attribute. For 
example 31/4 inches would be 3.25 


CoupledUnits 


Int 4, not 
NULL 


Type of units for the Value field above. This is an 
enumeration whose valid values and descriptions 
depend on the MeasurementType field in the 
attribute definition table for the attribute with Attrld. 
See the current units schedule for a list of unit types. 


Position 


Int 4, not 
NULL 


The ordering position for multiple attributes of a 
record. Beginning with 0, each position is unique per 
Id. 



Create a non-Unique Valued, Clustered Index on Id, very important for performance 
Create a Unique Valued Index on Id + Attrld + Position 
Create a non-Unique Valued, Index on Attrld 

Create a Unique Valued, Index on Id + Attrld + Value + Units + CoupledValue + 
CoupledUnits 
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Following is an example of some couples 
350 hp @ 2500 rpm 
375 hp @ 3000 rpm 



Matching Sets Table 

Quick description of matching sets 

Matching sets are a way of associating products in one category with products in 
another category. For example Nuts and Bolts are two categories. The products in the 
Nuts category match the products in the Bolts category if their Width and Thread Pitch 
match. A matching consists of the two categories and a list of the common attributes 
that must match for a product to be considered a match. 

The matching set tables store the matching set information. The names are 

_A2i_MS_x_ where x is the Tableld of the HierAttrTable that contains the categories 
that have the groupings. 



No primary key is needed 



SQL field 
name 


SQL Field 
Type 


Description 


Idl 


Int 4, not 
NULL 


Category Id. Must specify an existing Id in the 
_A2i_x_ primary table, where x matches this table. 


Id2 


Int 4, not 
NULL 


Category Id. Must specify an existing Id in the 
_A2i_x_ primary table, where x matches this table. 


CatlAttrld 


Int 4 not NULL 


Attribute Id. Must specify an existing Attrld in the 
_A2i_A_x_ attribute definition table, where x matches. 


CatlRating 


Int 4 not NULL 


For Text Attributes, this always equals -1. 

For Numeric Attributes, this is the rating to match on. 

If the value is set to -1 for numeric attributes, the first 

available rating will be chosen and written to sql 

when you start the database with the rebuild indices 

option. This allows easy updating of previous 

databases 


Cat2AttrId 


Int 4 not NULL 


same as CatlAttrld 


Cat2Rating 


Int 4 not NULL 


same as CatlRating 



Ix_MS_x_Idl, non-unique index on Idl 



92 



WO 02/25500 



PCT/US01/29486 



IxJVlS_xJd2, non-unique index on Id2 

Family Tables 



Families 

Quick description of families. 

Families are a way of grouping records by structured queries, then assigning common 
information to the groups and organizing each group's display of its records. Each 
group of records is called a family. 

Families are created by Partitioning the records based on a category, then sub- 
partitioning these groups based on other categories or attributes. With the exception of 
the first partition, families only exist where the combination of values in the partitioned 
fields /attribute results in a non-zero set of records. 

The first partition is special in a few ways: 

1) Its partition field is specified in the _A2i_Server_ table, FamilyCatFieldld 

2) It can only be a field, not an attribute, because attribute do not exist at a global level 

3) If you wish to partition on attributes, the category field that uses attributes must be 
this first partition 

4) Families in the first partition ALWAYS exists even if no records belong to them. 
This is a convenience to allow some initial family setup before all the data is entered. 

Within a group, the records can be Pivoted by Depth, Vertically or Horizontally. This 
extracts the values of the pivot field and makes a separate section for records with that 
value. 
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Family Structure Table 

This table holds all the partition, pivot, sorting, ordering and hidden information. 
Structure is tied to a family node. All children then inherit it, unless the child overrides 
the inheritance. Children can override each type of structure element individually. 

Partition - This determines the hierarchy of the family tree. Only main table lookup 
fields, and Text Attributes are allowed in the partition. Numeric attributes are not 
allowed. Every time you add a field /attribute to the partition, you create additional 
child family nodes below the current child nodes. The records will be split up 
according to the values they have for the new partition field/attribute. 

Pivot (Depth, Vertical, Horizontal) - This also splits up records into groups, but is used 
for display only. It does not create new family nodes 

Sorting - This specifies which fields /attributes to sort on in the final display. More than 
one field/attribute can be used. The display will sort first on the first field/attribute, 
then on the second, etc. 

Ordering - This is the display order of the fields/ attributes in the final display. 

Hidden - This is a list of fields/ attributes that should not be displayed. 

Partition and Pivot allow you to concatenate multiple fields at the same level. This has 
a slightly different effect than placing the fields on different levels. For example, a 
family has 2 attributes available for partitioning, Color(red, blue) and 
Horsepower(gutless, gas-guzzler). Creating 2 partition levels, the first with Color and 
the second with Horsepower would look like. 

red family 

gutless family 

gas-guzzler family 
blue family 

gutless family 

gas-guzzler family 

If you added a single partition level with Color and Horsepower, Color would be in 
NestedPosition = 0, Horsepower in NestedPosition = 1, and you'd get 

red - guttless family 
blue - gas-guzzler family 
red - guttless family 
blue - gas-guzzler family 
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The name of the table is 
_A2iJFamilyStructure_ 



The structure of this table follows: 



SQL field 
name 


SQL Field 
Type 


Description 


Familyltemld 


Int 4, not 
NULL 


Family Item Id, root has Itemld = 0, others continue 
from Ion up. 


StructureType 


Int 4, not 
NULL 


Structure type specified: 

1 = FamilyPartition 

2 = FamilyDepthPivot 

3 = FamilyHorizontalPivot 

4 = Family VerticalPivot 

5 = FamilySorting 

6 = FamilyOrdering 

7 = FamilyHidden 


NestedPosition 


Int 4, not 
NULL 


Position for Structure Type in this Family. Starts 
with 0, the next additional position is 1, etc. 


Concatenation 
Position 


Int 4, not 
NULL 


Position within a NestedPosition that this item exists 

in. The first position is 0, then 1 and so on. 

For StructureTypes 5,6,7 this is always 0. 

For StructureTypes 1,2,3,4/ you may have more than 

1 field specified for a partition or pivot, in that case 

the second field has position 1, and so on. 


FieldOrAttrld 


Int 4, not 

XTT TT T 


Main Table Field Id or Attribute field Id 


IsAttributeFiel 
d 


Bit, not NULL 


Whether this is an attribute or a main table field 


Rating 


Int 4, not 
NULL 


If not an attribute field set it to 0. 

For Features set it to -l(InvalidRating) 

For Characteristic set it to the one of the following 

values 

1 = Minimum 

2 = Maximum 
4 = Typical 

8 = Nominal 
16 = Average 


SortType 


Int 4, not 
NULL 


Only used for Structure Type 5(FamilySorting). 
1 = ascending 
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1 1 0 = descending 

Create a Unique Index on Familyltemld, StructureType, NestedPosition, 

ConcatenationPosition 

Family Structure Recycled Table 

This table holds information about familiy nodes that have been deleted, but had family 
structure information defined. 

The name of this table is 
_A2i_FamUyStructureRecycled_ 



The structure of this table follows: 



SQL field 
name 


SQL Field 
Type 


Description 


Familyltemld 


Int 4, not 
NT IT T 


Recycled Family Item Id, start from 1 on up. No root 


StructureType 


Int 4, not 
NULL 


Structure type specified: 

1 = FamilyPartition 

2 a FamilyDepthPivot 

3 = FamilyHorizontalPivot 

4 = Family VerticalPivot 

5 = FamilySorting 

6 = FamilyOrdering 

7 = FamilyHidden 


NestedPosition 


Int 4, not 
NULL 


Position for Structure Type in this Family. Starts 
with 0, the next additional position is 1, etc. 


Concatenation 
Position 


Int 4, not 
NULL 


Position within a NestedPosition that this item exists 

in. The first position is 0, then 1 and so on. 

For StructureTypes 5,6,7 this is always 0. 

For StructureTypes 1,2,3,4, you may have more than 

1 field specified for a partition or pivot, in that case 

the second field has position 1, and so on. 


FieldOrAttrid 


Int 4, not 
NULL 


Main Table Field Id or Attribute field Id 


IsAttributeFiel 
d 


Bit, not NULL 


Whether this is an attribute or a main table field 


Rating 


Int 4, not 
NULL 


If not an attribute field set it to 0. 

For Features set it to -l(InvalidRating) 

For Characteristic set it to the one of the following 

values 

1 = Mirumum 
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2 = Maximum 






4 = Typical 






8 = Nominal 






16 = Average 


SortType 


Int 4, not 


Only used for Structure Type 5(FamilySorting). 


NULL 


1 = ascending 






0 = descending 



Create a Unique Index on Familyltemld, StructureType, NestedPosition, 



ConcatenationPosition 
Family Items Table 

This table holds basic information about the family. It is a global table that applies to 
the main table in the database. 



_A2iJFamilyItems_ 



SQL field name 


SQL Field 
Type 


Description 


Faihilyltemld 


Int 4, not 
NULL 


Family Item Id, unique, root has Id = 0, others 
continue from 1 on up. 


Parentld 


Int 4, not 
NULL 


Parent Id of this item 


RelativePosition 


Int 4, not 
NULL 


Relative position of this family item within its 
siblings. Because families only exist where their 
query results in a non-empty set of records, not 
all combinations of the partitioned fields result 
in families. The relative position is based on the 
actual position of the partitioned fields' 
attributes 


InheritPartition 


Bit, not NULL 


1 when family item inherits this value from its 
parent 


InheritDepthPivot 


Bit, not NULL 


1 when family item inherits this value from its 
parent 


InheritVerticalPivot 


Bit, not NULL 


1 when family item inherits this value from its 
parent 


InheritHorizontalPi 
vot 


Bit, not NULL 


1 when family item inherits this value from its 
parent 


InheritSorting 


Bit, not NULL 


1 when family item inherits this value from its 
parent 


InheritOrdering 


Bit, not NULL 


1 when family item inherits this value from its 
parent 


InheritHidden 


Bit, not NULL 


1 when family item inherits this value from its 
parent 
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Create a Clustered, Unique Index on Familyltemld 

This table requires an initial ROOT node with the following values 
Itemld = 0 
Parentld = -1 
RelativePosition = -1 
Inherit* = 1 (all inherits set to 0) 

Note : the user may assign structure information to this root node, so the inherit* values 
may change. 

Since the first partition always results in families, this table must be initialized with all 
the values in the category table chosen as the first partition. The Itemld, Parentld, and 
RelativePosition may be initially set to the same value in the category table. Although 
these values may diverge after time. 
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Family Items Recycled Table 

This table holds basic information about family nodes that have been deleted, but 
contained links to common information or structure. This allows users to recover their 
work when then make a change that destroys these families 



.A2i_FamilyItemsRecycled_ 



SQL field name 


SQL Field 
Type 


Description 


Familyltemld 


Int 4, not 

TvTTTT T 


Family Item Id 


Description 


Varcnar 255, 
not NULL 
default empty 
string 


• — • F~r — 7 — ~i j — n~> r 

Description of the family node. Since the 

position in the family hierarchy is lost by 

deleting a node, tnis description is a patn to 

where the family used to be. i.e. 

v^aLegory.Dearmgb-^iviir.o-NX"--> 1 ypc.ucui. 


InheritPartition 


Bit, not NULL 


1 when family item inherits this value from its 
narpnt 


InheritDepthPivot 


Bit, not NULL 


1 when family item inherits this value from its 
parent 


InheritVerticalPivot 


Bit, not NULL 


1 when family item inherits this value from its 
parent 


InheritHorizontalPi 


Bit, not NULL 


1 when family item inherits this value from its 


vot 




parent 


InheritSorting 


Bit, not NULL 


1 when family item inherits this value from its 
parent 


InheritOrdering 


Bit, not NULL 


1 when family item inherits this value from its 
parent 


InheritHidden 


Bit, not NULL 


1 when family item inherits this value from its 
parent 



Create a Clustered, Unique Index on Familyltemld 
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Family Item Values Table 

This table holds the information describing the partial query for each family node. 
Every node represents 1 or more criteria. Tracing the node back to the root gives you 
the entire query. 

Nodes are allowed to have more than 1 field/value combination. This occurs when an 
ancestors partition specified more than 1 field for the partition's NestedPosition. This 
node then represents a concatenation of values. 



This Family has the name 
_A2i_FamilyItemValues_ 



SQL field name 


SQL Field 
Type 


Description 


Familyltemld 


Int 4, not 
NULL 


Family Item Id 


FieldOrAttrld 


Int 4, not 
NULL 


Field or Attribute Id this value corresponds to 


FieldOrAttrValue 


Int 4, not 
NULL 


Value of field. Since only lookup fields, and Text 
Attributes are allowed in the partition, this value 
is always a Uint32 


IsAttributeField 


Bit, not NULL 


Whether this is a lookup field value, or attribute 
text value 


ConcatenationPositi 


Int 4, not 


Where in the concatenation of values this value 


on 


NULL 


exists. This starts at 0, and continues if more 
than 1 field are concatenated at this partitions 
NestedPosition. ! 



Create a Clustered, non-Unique Index on Familyltemld 

Create a Unique Index on Familyltemld + ConcatenationPosition 



The initial table needs a definition for the ROOT node. 
Itemld = 0 

Fieldld = {main table category field Id used as base category field for family tree) 
FieldValue = 0 
IsAttributeField = 0 
ConcatenationPosition = 0 
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Family Fields Table 

This table specifies which fields all families have. Just like primary tables, families can 
have fields. The field values apply to all records in the family. 



The name of this table is 
_A2i_FamilyFields_ 



SQL field name 


SQL Field Type 


Description 


Permanentld 


Int^ not NULL, 
Identity (1, 1) 


Ever increasing Id used to make sure newly 
added records are not confused with previous 
records that had the same id. Oracle does not 
need the Identity(l,l) descriptor. 


FamilyFieldld 


11*4, not NULL 


Fieldld, starting at 1 


FamilyFieldNam 
e 


Varchar 50, not 
NULL 


name of field 


FamilyFieldType 


mt4,notNULL 


Type of field. For now, all fields must be 
object data fields, valid types are 

12- TextField 

13- MultiTextField 

14 - LnageField. 

15 - MultilmageField 
16-SoundField(NYI) 

17 - MultiSoundField, NYI 

18-VideoField,NYI 

19 - MultiVideoField, NYI 

20 - NOT USED 

21 - NOT USED 

22- ExtFiekLNYI 

23- MultiExtField,NYI 

31 - MultiTemplateField, NYI 

32 - PDFDocumentField, NYI 

33 - MultiPDFDocumentField, NYI 


LookupTableld 


Int4,notNULL 


The table Id of the object table that this field's 
values correspond to. 



Create a Clustered, Non-unique Index on FamilyFieldld 
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Family Record Values Table 

This holds the values set for the Family Fields for all family items. This table is like the 
Attribute _c_ and _f_ table in that if a family item does not have a value set, nothing is 
stored. 



The name of this table is 
_A2i_FamilyRecVals_ 



SQL field 
name 


SQL Field 
Type 


Description 


Familyltemld 


Int 4, not 
NULL 


Family Item Id 


FamilyFieldld 


Int 4, not 
NULL 


Family Field Id 


Value 


Int 4, not 
NULL 


Value. This corresponds to a record in the object 
table linked to this field. This cannot be 0. If more 
than one value are set for a field (multi-valued fields) 
there will be more than one entry in this table for that 
field 



Create a Clustered, Non-unique Index on Familyltemld 
Create a Unique Index on Familyltemld, FamilyFieldld, Value 



Family Record Values Recycled Table 

This holds information for deleted family items that had fields set 



The name of this table is 
_A2iJFamilyRecValsRecycled_ 



SQL field 
name 


SQL Field 
Type 


Description 


Familyltemld 


Int 4, not 
NULL 


Family Item Id 


FamilyFieldld 


Int 4, not 
NULL 


Family Field Id 


Value 


Int 4/ not 
NULL 


Value. This corresponds to a record in the object 
table linked to this field. This cannot be 0. If more 
than one value are set for a field (multi-valued fields) 
there will be more than one entry in this table for that 
field 



Create a Clustered, Non-unique Index on Familyltemld 
Create a Unique Index on Familyltemld, FamilyFieldld, Value 



102 



WO 02/25500 



PCT/US01/29486 



Family Column Names Table 

This holds information about family column names. 



The name of this table is 
_A2i_FamilyColumnNames_ 



SQL field 
name 


SQL Field 
Type 


Description 


Familyltemld 


Int 4, not 
NULL 


Family Item Id 


FamilyOrAttrl 
d 


Int 4, not 
NULL 


Field or Attribute Id this value corresponds to 


IsAttributeFiel 
d 


Bit, not NULL 


Whether this is a lookup field value, or attribute text 
value linked to this field. This cannot be 0. If more 
than one value are set for a field (multi-valued fields) 
there will be more than one entry in this table for that 
field 


Rating 


Tinyint, not 
NULL 


????????? 


DisplayName 


Varchar 255, 
not NULL 


Displayed name for the column 



Create a Unique Index on Familyltemld, FamilyOr Attrld, Rating 



103 



WO 02/25500 



PCT/US01/29486 



Large Object (Lob) Data Tables (images, video, etc) 

The organization and structure of large object data (sometime referred to as external or 
indirect data) is stored in the SQL database. The xCat Server does not cache it. 

_A2i_CM_Data_Tables_ 



This describes the structure of the data tables. These data tables have names 
_A2i_Data_x_ where x is an Id starting at 1. 



SQL field 
name 


SQL Field Type 


Description 


DataTableld 


bit 4, not NULL, 
Primary Key. 


Id starting at 1 of the Data Table. The table 
names will be _A2iJData_x_ where x is this 
Id. 


Permanentld 


bit 4, not NULL, 
Identity (1, 1) 


Ever increasing Id used to make sure newly 
added records are not confused with 
previous records that had the same id. 
Oracle does not need the Identity(l,l) 
descriptor. 


DataTableType 


bit 4 not NULL 


Type of table, valid values are: 

5, 6, 7, 8, 18 
Refer to the Table Type Schedule for a list of 
TableType values and a description of each. 


DataTableNam 
e 


Varchar 255, not 
NULL 


Name of table 


Create a Primary, Unique index on DataTableld 


_A2i_CM_Data_Groups_ 




This is a table of user defined groups that the external data items can be assigned to. It 
is a way to categories the data items for easy searching at a later time. Each record is a 
group. 


SQL field 
name 


SQL Field Type 


Description 


Id 


bit 4, not NULL, 
Primary Key. 


Id of this group starting at 1 


Parentld 


bit 4 not NULL 


Id of this group's parent. This must be -1 
for top level groups or an existing Id in this 
table for child groups 
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GroupName 


Varchar 255, not 


Name of group 




NULL 





Create a Primary, Unique index on Id 
NOTE: Do not insert a null record 



A2i_CM_Data_Locations 



This hierarchical table describes exactly where the data items are. Data items are 
assigned ids from this table to specify exactly where they are. 



SQL 
field 
name 


SQL Field 
Type 


Description 


Id 


Int 4, not 
NULL, 
Primary Key. 


Id of this location starting at 1 


Parentld 


Int 4, not 
NULL 


Id of this location's parent. This must be -1 for top level 
locations or an existing Id in this table for child locations. 


Name 


Varchar 50, 
Not NULL 


Name of location. Each name is part of a universal path, so 
the name must conform to file and directory naming 
restrictions. No backslashes \ are allowed in the name. 


Type 


Int 4, not 
NULL 


Type of location. Valid types and their meanings are: 






1 - PhysicalLocation, This is a physical location such as 
AH, Century City Office, or Server Room. 
PhysicalLocations start at the top of the hierarchy. 






2 - Computerisation, This is the network name of the 
computer where item data can reside. These locations 
appear direcly under Physical Locations and before any 
volume information. 






3 - SharedFixedDeviceLocation, This is a shared network 
volume such as big_vol, data or catalogs. It comes after 
ComputerLocation and before RelativePathLocation in 
the hierarchy. 






4 - LocalFixedDeviceLocation, This is a local permanent 
disk drive. Usually named c$ or d$ to indicate the c: or 
d: drive. It comes after ComputerLocation and before 
RelativePathLocation in the hierarchy. 






5 - RemovableDeviceLocation, This is a local removeable 
drive such as a cd-rom drive. It is usually named e$ to 
indicate the e: drive. This comes after 
ComputerLocation and before 
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RemovableMediaLocation in the hierarchy. 

6 - RemovableMediaLocation, This is the volume name of 

the removable disk. It comes after the 
RemovableDeviceLocation and before 
RelativePathLocation. 

7 - RelativePathLocation, This is a part of a relative path on 
; a drive. It represents 1 directory. Subdirectories will be 

be children of their parent RelativePathLocations. 

Create a Primary, Unique index on Id 

NOTE: the description field has been removed 



Each record represents 1 part in a part of locations. And example is 
A2iUSA\Dave_Office\sullivan\d$\work\images\testImages 



The records that make this up would be: 
(Id, Parentld, type, name) 

1, -1, PhysicalLocation, 

2, 1, PhysicalLocation, 

3, 2, Computer/Location, 

4, 3, LocalFixedDeviceLocation, 

5, 4, RelativePathLocation, 

6, 5, RelativePathLocation, 

7, 6, RelativePathLocation, 



A2iUSA 
Dave_Office 
sullivan 
d$ 

work 



images 

testlmages 



A2i G x 



This table is no longer used and can be removed from any existing databases 
_A2i_CM_Data_Views_ 

This table is no longer used and can be removed from any existing databases 
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.A2i_CM_Publications_ 



SQL field 


SQL Field 


Description 


name 


Type 




Id 


Int 4, not 


Id of this publication starting at 1 




NULL, 






Primary Key. 




Name 


Varchar 255 


name of the publication 



Create a Primary, Unique index on Id 



A2iJ?ublications_XL 



This table describes a publication, represented as a tree. V in the table name 
corresponds to an entry in the _A2iJZMJPublications_ table. 



SQL field 
name 


SQL Field Type 


Description 


Id 


Int 4, not NULL 


Id of this record, starting at 1 


Parent 


Int 4, not NULL 


Record Id of this record's parent, root id's parent is - 
1 


Type 


Int 4, not NULL 


Publication type: 

1- ??? (To Be Determined) 

2- ??? (etc) 


Position 


Int4,notNULL 


Position relative to other siblings of the same parent, 
starting at 1 


Parent 


Int 4, not NULL 


Record Id of this record's parent, root id's parent is - 
1 


Name 


Varchar 255 


Displayed name of the publication 


Data 


Image, not 
NULL 


Binary Object. Structure is ... 


Create a Primary, Unique index on Id 




_A2LCM_Media_ 




This table contains user-defined descriptions of the media type of the item data. 


SQL field 
name 


SQL Field 
Type 


Description 


Id 


Int 4, not 
NULL, 
Primary Key. 


Id of this media type starting at 1 


Media 


Varchar 50 


name of the media 


Parentld 


Int 4, not 
NULL 1 


Id of parent media type. Top level media types have 
a Parentld of -1 



Create a Primary, Unique index on Id 
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_A2LData_x_ 

These tables contain the definitions of the data items. Each record represents a single 
data item. All data tables have the first 5 fields in common. 



SQL field 
name 


SQL Field 
Type 


Description 


Datald 


Int 4, not 
NULL, 
Primary Key. 


Id of this data item starting at 1, there is no record 0. 


OrigMediald 


bit 4, not 

"N.TTTTT J _ C~ - - 1 1. 

NULL, default 
0 


Id of the original media type, must be zero 
(indicating that no media type is assigned) or a valid 
Id in the _A2i_CMJvledia_ table 


OrigLocationld 


Int 4, not 
NULL, default 
0 


Id of the original location, must be a valid Id in the 
_A2i_CMJLocations_ table 


DataGroupId 


Int 4, not 
NULL, default 
0 


Id of the group this item belongs to. Must be a valid Id in 
the _A2i_CM_Data_Groups_ table 


DataSize 


Int 4, not 
NULL, default 
0 


Size in bytes of the stored data object. For the TextTable 
type, the size is the sum of both TextStart and TextRest 



Create Primary, Unique index on Datald 



Each type of data table (text, image, pdf, video, sound) has additional fields. Currently 
only the image, text and pdf tables are fully defined. 



Text Table (Type 5) additional fields 



SQL field 
name 


SQL Field Type 


Description 




TextStart 


Varchar 255, not 
NULL 


first 255 characters of text 




TextRest 


Text 


remaining text 




no other supporting tables 






PDF Table (Type 18) additional fields 






SQL field 
name 


SQL Field Type 


Description 




OrigName 


Varchar 255, not NULL 


Original name of this item 


HasOriginal 


Bit, not NULL, default 
0 


Specifies whether or not the original pdf is ! 
stored in the sql database. If so, there will be a 
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record with the same Datald in the 
corresponding _A2i_Originals_x_ table which 
resides in the [DatabaseName] Originals 
database 



PDF tables have supporting tables in the { database } ..Originals or { database )0 database. 
The supporting table is _A2i_Origiiials_x_ where x matches the Id of the main database table 



SQL field 
name 


SQL Field Type 


Description 


Datald 


Int4,notNULL 


Id matching Id in main database 


PDF 


Image, not NULL 


Actual pdf document 


Image Table (Type 6) additional fields 


SQL field 
name 


SQL Field Type 


Description 


OrigName 


Varchar 255, not 
NULL 


original name of this item 


ProcessedNam 
e 


Varchar 255, not 
NULL 


optional new name after processing 


Width 


Int4,notNULL. 


Width in pixels of image 


Height 


Int4,notNULL 


Height in pixels of image 


HasOriginal 


Bit, not NULL 


Specifies whether or not the original images is 
stored in the sql database. If so, there will be a 
record with the same Datald in the 
corresponding _A2i_Originals_x_ table which 
resides in the [ DatabaseNarneJOn^nals 
database 


Format 


Int4,notNULL 


Format of the image. 

1- BMP 

2- GIF 

3 - JPEG 

4- TIFF 

5 - PCD 

6- EPS 

7 - PNG 

8- PSD 


Zipped 


Bit, not NULL 


Specifies if the original image stored in the 
database is zipped. 



Image tables have support tables in the {databaseLOriginals and 
{database} JThumbnails databases. 



109 



WO 02/25500 



PCT/US01/29486 



In the {database}_Originals or {databaseJO, the supporting table is _A2i_Originals_x_ 
where x matches the Id of the main database table 



SQL field 
name 


SQL Field Type 


Description 


Datald 


fcit4,notNULL 


Id matching Id in main database 


Original 


Image, not NULL 


Original (not altered) image 


In the {database} JThumbnails or {database}T, the supporting table is 
_A2i_Thumbnails_x_ where x matches the Id of the main database table 


SQL field 
name 


SQL Field Type 


Description 


Datald 


Int4,notNULL 


Id matching Id in main database 


Thumbnail 


Image, not NULL 


Thumbnail image generated from the 
original, currently bounded by (200 x 200) 
box. 



For each _A2i_Originals_x_ and _A2i_Thumbnails_x_, create a unique primary index on Datald. 
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Image Variant Tables 

Images in the Catalog Manager can be processed to various specifications and stored. 
An Image Variant is the term used to describe a processed image. 

_A2i_Img_Filters_ 



Filters table. (Currently not used) 



SQL field 


SQL Field Type 


Description 


name 






Filterld 


Int 4, not NULL 


Id of the filter. 


Filter 


Image, not NULL 





Create a Unique index on Filterld 



_A2iJmg_Scripts _ 



Scripts table, (Currently not used) 



SQL field 


SQL Field Type 


Description 


name 






Scriptld 


Int 4, not NULL 




ScriptName 


Varchar 50, not NULL 





Create a Unique index on Scriptld 



_A2i_Img_SF_ 



Script-Filter table. (Currently not used) 



SQL 


SQL Field 


Description 


field 


Type 




name 






Scriptld 


Int 4, not 
NULL 




Filterld 


Int 4, not 
NULL 





Create a Unique index on Scriptld, Filterld 



A2iJmg_Variants_ 



Variants Table. This is the directory for all Variant tables in the database. 



SQL field name 


SQL Field Type 


Description 


Variantld 


Int 4, not NULL 


Id of this Variant | 
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VariantName 


Varchar 12o, not 

XTT TT T 

JNULL 


JName or the 
variant 


TAT- JiL 

Width 


T— x. yl — —X. X TT TT T 

int 4, not JN ULL 


Bounding wiutn 


Height 


t„ a, a - .4. X TT TT T 

lnt 4, not JN ULL 


Dounaing Jtieignt 


OptimizeStorage 


T«-k4- /I -r>i-v4- XTT TT T 

lnt 4, not JN U LL 




IVScalingMoae 


T~»a, A j-**. XTT TT T 

lnt 4, not In ULL 




OutputResolution 


T««i. /I XTT TT T 

lnt 4, not JNULL 




IVColorMode 


Int 4, not NULL 




IVPaletteType 


Int 4, not NULL 




IvColorReductionMetho 
d 


Int 4, not NULL 




IccProfile 


Varchar 255, not 

XTT TT T 
JNULL 


rath of ICC rronle 


GammaCorrection i 


T— i. A _ — . i. X TT TT T 

lnt 4, not JNULL 




IVOutputFormat 


t„ 1 ^ __i XTT TT T 

lnt 4, not JNULL 




lVSubrormat 


t„ 1 A XTT TT T 

lnt 4, not JN U LL 




A J IT) J 

AddBorders 


t_ _ j. A „ _ «. XTT TT T 

Int 4, not NULL 




BorderRGB 


T _ « A _ » "K TT TT T 

Int 4, not NULL 




BorderTopPixels 


T t. A i_ A. TT TT T 

Int 4, not NULL 




BorderBottomPixels 


T - t A _ _ t, X TT TT T 

Int 4, not JNULL 




BorderLeftPixels 


t * a . . ^ t X TT TT T 

Int 4, not NULL 




BorderRightPixels 


T 1 /I _ _ l X TT TT T 

lnt 4, not NULL 




A 1 JTAT i 1 

AddWatermark 


T— 1. A ^ _ 1. X TT TT T 

lnt 4, not JNULL 




i v vv arerniarK i ype 


Tnt 4. not NTT TT T 




WatermarkSize 


Int 4, not NULL 




IVWatermarkPosition 


Int 4, not NULL 





Create Primary, Unique index on Variantld 



_A2iJmg_VIS_ 



Variant-Image-Script table. There is only one _A2iJmg_VIS_ table in one database. This 
table stores information about all Variant images. 



SQL field 


SQL Field Type 


Description 


name 




Variantld 


Int 4, not NULL 




Imageld 


Int 4, not NULL 




Scriptld 


Int 4, not NULL 




Status 


Int 4, not NULL 




DataSize 


Int 4, not NULL 




Width 


Int 4, not NULL 




Height 


Int 4, not NULL 




Format 


Int 4, not NULL 
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Create an index on Variantld 

Create a Unique index on Variantld, Imageld 
_A2iJmages_ 



The actual image variant data is stored in a Variant database (its name is stored in table 
A2i_xCat_DBs). There's only one table in this database named as "_A2IJmages_" 



SQL field 


SQL Field Type 


Description 


name 






DataTableld 


Int^notNULL 




Variantld 


11*4, not NULL 




Datald 


Int4,notNULL 




Variant 


Image, not NULL 




CrcOfOriginal 


Int 4, Not NULL 


CRC of Original Image when Variant was set ! 



Create a Unique index on DataTableld, Variantld, Datald 



GRAVEYARD 

Publishing catalogs of product information to paper and to electronic media are two 
very distinct processes, with very different standards and expectations for quality. 
Paper catalogs have been meticulously laid out a page at a time by populating page 
layout programs with product data, a process that is manual, time-consuming, tedious 
and very, very expensive, but has typically resulted in high page density, flexible and 
well-structured tabular layout formats, and a very high overall standard of quality. By 
contrast, electronic catalogs have been generated programmatically in real-time but 
have comparably primitive presentations and are of relatively low quality. The 
challenge is to eliminate the distinctions between paper and electronic output and 
combine the best of both media in a way that dramatically reduces the cost of laying out 
paper catalogs by flexibly and programmatically generating page layouts automatically 
and in real time, and at the same time brings the structure and high standard of quality 
to electronic catalog pages without custom programming. 



Thus a method and apparatus for dynamically formatting and displaying tabular 
data in real time has been described. The invention, however, is defined by the claims 
and the full scope of any equivalents. 
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CLAIMS 

What is claimed is: 

1) In a computer system, a method for formatting table data comprising: 

obtaining a group of records; 

obtaining layout information from a user wherein said layout information is 
stored independent of said group of records; 

dynamically applying said layout information to said first table to generate a 
preview table; 

displaying said preview table to said user. 

2) The method of claim 1 wherein said user modifies said layout information and 
said computer system iteratively executes said step of dynamically applying said layout 
information to said first table to generate said preview table. 

3) The method of claim 1 wherein said group of records comprises a plurality of 
fields. 

4) The method of claim 1 wherein said group of records comprises a plurality of 
attributes. 

5) The method of claim 1 wherein said group of records are related by at least one 
common value. 
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6) The method of claim 5 wherein said at least one common value represents at 
least one value from said group of records 

7) The method of claim 1 further comprising: 

displaying at least a subset of said group of records in a table. 

8) The method of claim 1 wherein said layout information defines hidden fields. 

9) The method of claim 1 wherein said layout information defines hidden 
attributes. 

10) The method of claim 1 wherein said layout information comprises at least one 
pivot value associated with a pivot operation. 

11) The method of claim 10 wherein said at least one pivot value is hidden from said 
preview table. 

12) The method of claim 10 wherein said at least one pivot value is from said group 
of records. 

14) The method of claim 10 wherein said pivot operation reduces redundant 
information displayed in said preview table. 

15) The method of claim 10 wherein said pivot operation comprises: 
identifying at least one pivot axis in a table wherein said pivot access 

corresponds to said at least one pivot value; 

removing said at least one pivot axis from said table; 
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generating said preview table by breaking said preview table into sub-tables 
based on said at least one pivot axis. 



16) The method of claim 15 wherein said pivot axis comprises a row in said table. 

17) The method of claim 15 wherein said pivot axis comprises a column in said table. 

18) The method of claim 15 further comprising: 

sorting said group of records from said table into said sub-tables based on said at 
least one pivot value of said pivot axis. 

19) The method of claim 15 wherein said pivot operation comprises a horizontal 
pivot comprising: 

combining said sub-tables into said preview table by arranging said sub-tables 
horizontally. 

20) The method of claim 19 further comprising: 

adding an additional row to said preview table comprising said at least one pivot 
value, said additional row labeling said sub-tables. 

21) The method of claim 15 wherein said pivot operation comprises a stack pivot 
comprising: 

combining said sub-tables into said preview table by arranging said sub-tables 
vertically. 
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22) The method of claim 21 further comprising adding an additional row to said 
preview table containing said at least one pivot value before each of said sub-tables. 

23) The method of claim 21 further comprising: 

preserving each of said sub-tables; 

labeling said sub-tables with said at least one pivot value. 

25) The method of claim 15 wherein said pivot operation comprises a vertical pivot, 
said vertical pivot comprising: 

combining sub-tables into a table by arranging said sub-tables vertically. 

26) The method of claim 25 further comprising: 

adding an additional column containing said at least one pivot value to label a 
group of rows comprising each sub-table. 

27) The method of claim 10 wherein said pivot operation comprises a horizontal 
pivot. 

28) The method of claim 27 wherein at least one additional horizontal pivot is nested 
with said horizontal pivot. 

29) The method of claim 10 wherein said pivot operation comprises a stack pivot. 

30) The method of claim 29 wherein at least one additional stack pivot is nested with 
said stack pivot. 
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31) The method of claim 10 wherein said pivot operation comprises a vertical pivot. 

32) The method of claim 1 wherein said layout information comprises fields for 
performing an ordering operation. 

33) The method of claim 1 wherein said layout information identifies fields to be 
shown in said preview table. 

34) The method of claim 33 wherein said layout information identifies a display 
sequence associated with said fields to be shown in said preview table. 

35) The method of claim 1 wherein said displaying said preview table occurs in real- 
time subsequent to said obtaining layout information. 

36) The method of claim 1 further comprising: 

providing contents of said preview table to a publication program. 

37) The method of claim 1 wherein said layout information identifies empty columns 
of said preview table to hide. 

38) The method of claim 1 wherein said layout information identifies cells of said 
preview table having equal values for merging. 

39) The method of claim 38 wherein said merging is with a sub-table of said preview 
table. 

40) In a computer system, a method for presenting family data comprising: 
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obtaining family data comprising a group of records related by at least one 
common value, wherein said family data comprises a subset of a hierarchical 
structure defined by a partition; 

generating a family table from said family data; 

presenting a visual representation of said family table to a user; 

obtaining layout information from a user; 

storing said layout information in said computer system independent of said 
family data; 

dynamically applying said layout information to said family table to generate 
a first preview table of said family data wherein said group of records in said first 
preview table depends on said layout information; 

presenting a visual representation of said first preview table to said user. 

41) The method of claim 40 further comprising: 

obtaining additional layout information from said user; 

dynamically applying said additional layout information to said first preview 

table; 

regenerating said first preview table to form a second preview table, wherein 
said group of records in said second preview table depends on said additional 
layout information. 

42) The method of claim 40 further comprising: 

obtaining modified layout information from said user, wherein said modified 
layout information differs from said layout information; 
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dynamically applying said modified layout information to said table; 

regenerating said first preview table to form a second preview table, wherein 
said group of records in said second preview table depends on said at modified layout 
information. 

43) The method of claim 40 wherein said step of dynamically applying said layout 
information executes in real-time upon obtaining said layout information from said 
user. 

44) The method of claim 40 wherein said obtaining family data comprising a group 
of records related by at least one common value comprises: 

obtaining a taxonomy comprising at least one category field, said taxonomy 
comprising said hierarchical structure; 

utilizing said at least one category field to define said partition in a 
partitioning hierarchy; 

layering said partitioning hierarchy on top of said taxonomy. 

45) The method of claim 44 further comprising: 

defining an additional partition on said partitioning hierarchy. 

46) The method of claim 44 where a pivot operation is performed at any level in said 
partitioning hierarchy. 

47) The method of claim 46 wherein said layout information comprises inheritance 
properties associated with child nodes of said partitioning hierarchy. 
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48) The method of claim 47 wherein said layout information comprises information 
for overriding said inheritance properties on a node-by-node basis. 

49) The method of claim 40 wherein said layout information comprises at least one 
pivot operation, said at least one pivot operation providing a mechanism for reducing 
redundant information shown in said first preview table. 

50) The method of claim 49 wherein said at least one pivot operation utilizes said at 
least one pivot value to execute at least one stack pivot. 

51) The method of claim 50 wherein at least one additional stack pivot is nested with 
said at least one stack pivot. 

52) The method of claim 49 wherein said at least one pivot operation utilizes said at 
least one pivot value to execute at least one horizontal pivot. 

53) The method of claim 52 wherein at least one additional horizontal pivot is nested 
with said horizontal pivot. 

54) The method of claim 39 wherein said at least one pivot operation utilizes said at 
least one pivot value to execute at least one vertical pivot. 

55) The method of claim 54 wherein at least one additional vertical pivot is nested 
with said vertical pivot. 

56) The method of claim 49 wherein said at least one pivot operation comprises: 
identifying at least one pivot axis in a table wherein said pivot axis corresponds 

to said at least one pivot value; 
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removing said at least one pivot axis from said table; 

generating said first preview table by breaking said first preview table into sub- 
tables based on said at least one pivot axis. 

57) The method of claim 56 wherein said pivot axis comprises a row in said table. 

58) The method of claim 56 wherein said pivot axis comprises a column in said table. 

59) The method of claim 56 further comprising: 

sorting said family data into said sub-tables based on said at least one pivot 
value of said pivot axis. 

60) The method of claim 56 wherein said at least one pivot operation comprises a 
horizontal pivot comprising: 

combining said sub-tables into said first preview table by arranging said sub- 
tables horizontally. 

61) The method of claim 60 further comprising: 

adding an additional row to said preview table comprising said at least one pivot 
value, said additional row labeling said sub-tables. 

62) The method of claim 56 wherein said at least one pivot operation comprises a 
stack pivot comprising: 

combining said sub-tables into said first preview table by arranging said sub- 
tables vertically. 
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63) The method of claim 62 further comprising adding an additional row to said first 

preview table containing said at least one pivot value before each of said sub-tables. 

64) The method of claim 62 further comprising: 

preserving each of said sub-tables; 

labeling said sub-tables with said at least one pivot value. 

65) The method of claim 56 wherein said pivot operation comprises a vertical pivot, 
said vertical pivot comprising: 

combining sub-tables into said first preview table by arranging said sub-tables 
vertically. 

66) The method of claim 65 further comprising: 

adding an additional column containing said at least one pivot value to label a 
group of rows comprising each sub-table. 

67) In a computer system, a method for presenting family data comprising: 

obtaining family data comprising a group of records related by at least one 
common value, wherein said family data comprises a subset of a hierarchical 
structure defined by a partition; 

generating a family table from said family data; 

presenting a visual representation of said family table to a user; 

obtaining at least one pivot value from a user, wherein said at least one pivot 
value comprise data associated with said family table; 
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storing said at least one pivot value in said computer system independent of 
said family data; 

dynamically applying said at least one pivot value to said family table during 
at least one pivot operation, wherein said at least one pivot operation generates a 
first preview table of said family data wherein said group of records in said first 
preview table depends on said at least one pivot value; 

presenting a visual representation of said first preview table to said user. 

68) The method of claim 67 further comprising: 

obtaining at least one additional pivot value from said user; 
dynamically applying said at least one additional pivot value to said first 
preview table; 

regenerating said first preview table to form a second preview table, wherein 
said group of records in said second preview table depends on said at least one 
additional pivot value. 

69) The method of claim 67 further comprising: 

obtaining at least one different pivot value from said user, wherein said at 
least one different pivot value differs from said at least one pivot value; 

dynamically applying said at least one different pivot values to said table; 

regenerating said first preview table to form a second preview table, wherein 
said group of records in said second preview table depends on said at least one 
different pivot value. 
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70) The method of claim 67 wherein said step of dynamically applying said at least 
one pivot value executes in real-time upon obtaining said at least one pivot value. 

71) The method of claim 67 wherein said obtaining family data comprising a group 
of records related by at least one common value comprises: 

obtaining a taxonomy comprising at least one category field, said taxonomy 
comprising said hierarchical structure; 

utilizing said at least one category field to define said partition in a 
partitioning hierarchy; 

layering said partitioning hierarchy on top of said taxonomy. 

72) The method of claim 71 further comprising: 

defining an additional partition on said partitioning hierarchy. 

73) The method of claim 71 where said at least one pivot operation is performed at 
any level in said partitioning hierarchy. 

74) The method of claim 67 wherein said step of obtaining at least one pivot value 
from said user comprises: 

obtaining layout information. 

75) The method of claim 74 wherein said layout information comprises inheritance 
properties associated with child nodes of said portioning hierarchy. 

76) The method of claim 75 wherein said layout information comprises information 
for overriding said inheritance properties on a node-by-node basis. 
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77) The method of claim 67 wherein said at least one pivot operation reduces 
redundant information shown in said first preview table. 

78) The method of claim 67 wherein said at least one pivot operation utilizes said at 
least one pivot value to execute at least one stack pivot. 

79) The method of claim 78 wherein at least one additional stack pivot is nested with 
said at least one stack pivot. 

80) The method of claim 67 wherein said at least one pivot operation utilizes said at 
least one pivot value to execute at least one horizontal pivot. 

81) The method of claim 80 wherein at least one additional horizontal pivot is nested 
with said horizontal pivot. 

82) The method of claim 67 wherein said at least one pivot operation utilizes said at 
least one pivot value to execute at least one vertical pivot. 

83) The method of claim 67 wherein said at least one pivot operation comprises: 
identifying at least one pivot axis in a table wherein said pivot axis corresponds 

to said at least one pivot value; 

removing said at least one pivot axis from said table; 

generating said first preview table by breaking said first preview table into sub- 
tables based on said at least one pivot axis. 

84) The method of claim 83 wherein said pivot axis comprises a row in said table. 

85) The method of claim 83 wherein said pivot axis comprises a column in said table. 
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86) The method of claim 83 further comprising: 

sorting said family data into said sub-tables based on said at least one pivot 
value of said pivot axis. 

87) The method of claim 83 wherein said at least one pivot operation comprises a 
horizontal pivot comprising: 

combining said sub-tables into said first preview table by arranging said sub- 
tables horizontally. 

88) The method of claim 87 further comprising: 

adding an additional row to said preview table comprising said at least one pivot 
value, said additional row labeling said sub-tables. 

89) The method of claim 83 wherein said at least one pivot operation comprises a 
stack pivot comprising: 

combining said sub-tables into said first preview table by arranging said sub- 
tables vertically. 

90) The method of claim 89 further comprising adding an additional row to said first 
preview table containing said at least one pivot value before each of said sub-tables. 

91) The method of claim 89 further comprising: 

preserving each of said sub-tables; 

labeling said sub-tables with said at least one pivot value. 
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92) The method of claim 83 wherein said pivot operation comprises a vertical pivot, 
said vertical pivot comprising: 

combining sub-tables into said first preview table by arranging said sub-tables 
vertically. 

93) The method of claim 92 further comprising: 

adding an additional column containing said at least one pivot value to label a 
group of rows comprising each sub-table. 

94) A computer program product comprising: 

a computer usable medium having computer readable program code configured 
to format table data embodied therein, said computer readable program code 
configured to: 

display in a graphical user interface a table having at least one field name and 
records associated with said at least one field name; 

display in said graphical user interface a layout component comprising a 
plurality of formatting commands for manipulating said table; 

display said field names in said layout component; 

obtain input from a user wherein said input comprises at least one field 

name; 

dynamically execute at least one of said plurality of formatting commands 
using said at least one field name, wherein said formatting command modifies said 
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table to generate a preview table comprising less redundant information in said 
preview table's records than said records of said table; 

display said preview table to said user in said graphical user interface. 

95) The computer program product of claim 94 wherein said computer readable 
program code configured to dynamically modify said table comprises at least one pivot 
operation. 

96) The computer program product of claim 94 wherein at least one of said 
formatting commands comprises an inheritance property associated with said at least 
one pivot operation. 

97) The computer program product of claim 94 wherein at least one of said 
formatting commands comprises an inherit display order command. 

98) The computer program product of claim 94 wherein said dynamically modifying 
said table to generate said preview table comprising less redundant information 
comprises merging each of said plurality of rows having duplicate information. 

99) The computer program product of claim 94 wherein at least one of said plurality 
of formatting commands comprises a stack pivot command and said at least one field 
name comprises a value utilized to perform a stack pivot on said table. 

100) The computer program product of claim 94 wherein at least one of said plurality 
of formatting commands comprises a horizontal pivot command and said at least one 
field name comprises a value utilized to perform a horizontal pivot on said table. 
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101) The computer program product of claim 94 wherein at least one of said plurality 
of formatting commands comprises a vertical pivot command and said at least one field 
name comprises a value utilized to perform a vertical pivot on said table. 

102) The computer program product of claim 94 wherein said dynamically modifying 
said table to generate said preview table comprising less redundant information 
comprises sorting said table. 

103) The computer program product of claim 94 wherein said table comprises family 
data. 

104) The computer program product of claim 94 wherein said user can remove said at 
least one field name using said layout component and said computer readable program 
code dynamically modifies said preview table accordingly. 

105) The computer program product of claim 94 wherein said computer readable 
program code identifies said at least one field as a hidden filed and removes a view of 
said at least one field from said preview table. 

106) An apparatus for formatting table data comprising: 
a processor; 

memory coupled to said processor- 
said memory comprising a module configured to obtain a group of records, 
obtain layout information from a user wherein said layout information is stored 
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independent of said group of records, and dynamically apply said layout information to 
said first table to generate a preview table; 

a display mechanism configured to present said preview table to said user. 

107) The apparatus of claim 106 wherein said apparatus generates said preview table 
in real time upon receipt of said layout information. 

108) The apparatus of claim 106 wherein said group of records comprises a plurality 
of fields. 

109) The apparatus of claim 106 wherein said group of records comprises a plurality 
of attributes. 

110) The apparatus of claim 106 wherein said group of records are related by at least 
one common value. 

111) The apparatus of claim 110 wherein said at least one common value represents at 
least one value from said group of records 

112) The apparatus of claim 106 wherein said layout information defines hidden 
fields associated with said preview table. 

113) The apparatus of claim 106 wherein said layout information defines hidden 
attributes associated with said preview table. 

114) The apparatus of claim 106 wherein said layout information comprises at least 
one pivot value. 
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115) The apparatus of claim 106 wherein said at least one pivot value is hidden when 
said display mechanism displays said preview table. 

116) The apparatus of claim 106 wherein said at least one pivot value is from said 
group of records. 

117) The apparatus of claim 106 wherein said pivot operation reduces redundant 
information displayed in said preview table. 

118) The apparatus of claim 106 wherein said module is configured to identify at least 
one pivot axis in a table wherein said pivot access corresponds to said at least one pivot 
value, remove said at least one pivot axis from said table, and generate said preview 
table by breaking said preview table into sub-tables based on said at least one pivot 
axis. 

119) The apparatus of claim 118 wherein said pivot axis comprises a row in said table. 

120) The apparatus of claim 118 wherein said pivot axis comprises a column in said 
table. 

121) The apparatus of claim 118 wherein said module is further configured to sort 
said group of records from said table into said sub-tables based on said at least one 
pivot value of said pivot axis. 

122) The apparatus of claim 118 wherein said module is further configured to 
combine said sub-tables into said preview table by arranging said sub-tables 
horizontally. 
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123) The apparatus of claim 118 wherein said module is further configured to add an 

additional row to said preview table comprising said at least one pivot value, said 
additional row labeling said sub-tables. 

124) The apparatus of claim 118 wherein said module is further configured to 
combine said sub-tables into said preview table by arranging said sub-tables vertically, 

125) The apparatus of claim 124 wherein said module is further configured to add an 
additional row to said preview table comprising said at least one pivot value before 
each of said sub-tables. 

126) The apparatus of claim 125 wherein said module is further configured to 
preserve each of said sub-tables and label said sub-tables with said at least one pivot 
value. 

127) The apparatus of claim 118 wherein said module is further configured to 
combine said sub-tables into a table by arranging said sub-tables vertically. 

128) The apparatus of claim 127 wherein said module is further configured to add an 
additional column comprising said at least one pivot value to label a group of rows 
comprising each sub-table. 

129) The apparatus of claim 106 wherein said layout information comprises fields for 
enabling said module to perform a sorting operation. 

130) The apparatus of claim 106 wherein said layout information identifies fields to be 
shown in said preview table when said display mechanism presents said preview table. 
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131) The apparatus of claim 106 wherein said layout information identifies a display 
sequence associated with said fields to be shown in said preview table when said 
display mechanism presents said preview table. 

132) The apparatus of claim 106 wherein said display mechanism presents said 
preview table in real-time subsequent to said obtaining layout information. 

133) The apparatus of claim 106 further comprising: 

a publication module, said publication module configured to prepare said 
preview table for publication. 

134) The apparatus of claim 106 wherein said layout information identifies empty 
columns of said preview table to hide. 

135) The apparatus of claim 106 wherein said layout information identifies cells of 
said preview table having equal values for said module to merge. 

136) In a computer system, a method for formatting table data comprising: 

obtaining a group of records; 
obtaining pivot information from a user; 

dynamically applying said pivot information to said first table to generate a 
preview table; 

displaying said preview table to said user. 
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137) The method of claim 136 wherein said user modifies said pivot information and 
said computer system iteratively executes said step of dynamically applying said pivot 
information to said first table to generate said preview table. 

138) The method of claim 136 wherein said group of records comprises a plurality of 
fields. 

139) The method of claim 136 wherein said group of records comprises a plurality of 
attributes. ' 

140) The method of claim 136 wherein said group of records are related by at least one 
common value. 

141) The method of claim 140 wherein said at least one common value represents at 
least one value from said group of records! 

142) The method of claim 136 further comprising: 

displaying at least a subset of said group of records in a table. 

143) The method of claim 136 wherein said pivot information comprises hidden 
fields. 

144) The method of claim 136 wherein said pivot information defines hidden 
attributes. 

145) The method of claim 136 wherein said pivot information comprises at least one 
pivot value associated with a pivot operation. 
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146) The method of claim 145 wherein said at least one pivot value is hidden from 
said preview table. 

147) The method of claim 145 wherein said at least one pivot value is from said group 
of records. 

148) The method of claim 145 wherein said pivot operation reduces redundant 
information displayed in said preview table. 

149) The method of claim 145 wherein said pivot operation comprises: 
identifying at least one pivot axis in a table wherein said pivot access 

corresponds to said at least one pivot value; 

removing said at least one pivot axis from said table; 

generating said preview table by breaking said preview table into sub-tables 
based on said at least one pivot axis. 

150) The method of claim 149 wherein said pivot axis comprises a row in said table. 

151) The method of claim 149 wherein said pivot axis comprises a column in said 
table. 

152) The method of claim 149 further comprising: 

sorting said group of records from said table into said sub-tables based on said at 
least one pivot value of said pivot axis. 

153) The method of claim 145 wherein said pivot operation comprises a horizontal 
pivot comprising: 
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combining said sub-tables into said preview table by arranging said sub-tables 
horizontally. 



154) The method of claim 153 further comprising: 

adding an additional row to said preview table comprising said at least one pivot 
value, said additional row labeling said sub-tables. 

155) The method of claim 145 wherein said pivot operation comprises a stack pivot 
comprising: 

combining said sub-tables into said preview table by arranging said sub-tables 
vertically. 

156) The method of claim 155 further comprising adding an additional row to said 
preview table containing said at least one pivot value before each of said sub-tables. 

157) The method of claim 155 further comprising: 

preserving each of said sub-tables; 

labeling said sub-tables with said at least one pivot value. 

158) The method of claim 145 wherein said pivot operation comprises a vertical pivot, 
said vertical pivot comprising: 

combining sub-tables into a table by arranging said sub-tables vertically. 

159) The method of claim 158 further comprising: 
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adding an additional column containing said at least one pivot value to label a 

group of rows comprising each sub-table. 

160) The method of claim 145 wherein said pivot operation comprises a horizontal 
pivot. 

161) The method of claim 160 wherein at least one additional horizontal pivot is 
nested with said horizontal pivot. 

162) The method of claim 145 wherein said pivot operation comprises a stack pivot. 

163) The method of claim 162 wherein at least one additional stack pivot is nested 
with said stack pivot. 

164) The method of claim 145 wherein said pivot operation comprises a vertical pivot. 

165) The method of claim 136 wherein said pivot information comprises fields for 
performing an ordering operation. 

166) The method of claim 136 wherein said pivot information identifies fields to be 
shown in said preview table. 

167) The method of claim 136 wherein said pivot information identifies a display 
sequence associated with said fields to be shown in said preview table. 

168) The method of claim 136 wherein said displaying said preview table occurs in 
real-time subsequent to said obtaining pivot information. 

169) The method of claim 136 further comprising: 
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providing contents of said preview table to a publication program. 

170) The method of claim 136 wherein said pivot information is associated with said 
group of records. 

171) The method of claim 170 wherein said pivot information has a dependent 
relationship with said group of records. 
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